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Gene Screen 

The invention relates to a screen for the identification of genes which show regulated 
expression in response to carbon source utilisation. 

5 

Colorectal cancer is a cancer which occurs in the large intestine and rectum. The 
colon can be divided into effectively four sections; the ascending colon; the 
transverse colon; the descending colon; and the sigmoid colon. Most colorectal 
cancers arise in the sigmoid colon and develop from "polyps" which can grow for 
10 several years before becoming cancerous. The early detection of these pre-cancerous 
growths is obviously desirable since removal of the polyps is a very effective means 
to stem the progress of disease. 

There are various types of colorectal cancer. Most cancers of this type are 
15 adenocarcinomas which are malignant growths which begin in the epithelial cells 
which line the colon and rectum. Other cancers of the colon and rectum include 
gastrointestinal stromal tumours and lymphomas. In some examples the patient can 
be asymptomatic and for this reason it is important that screening is undertaken to 
identify those patients in which pre-cancerous polyps are forming. However, some 
20 patients do present with symptoms and these include rectal bleeding, diarrhoea, 
constipation, abdominal pain, and general weakness. 

As mentioned above, regular screening is by far the most effective way of controlling 
this disease since removal of pre-cancerous polyps by surgery can effectively cure 

25 any disease before it is initiated. Currently, diagnostic tests include the use of 
colonoscopy, which allows a doctor to examine the rectum and colon; faecal blood 
analysis to check for any bleeding from the bowel and rectal area although this test is 
not directly diagnostic for cancerous lesion in its own right; and sigmoidoscopy 
which is similar to colonoscopy but only investigates the lower bowel area. 

30 Typically, patients with a family history of colorectal cancer can be expected to have 
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a colonoscopy every 5 years or so and a blood stool check on a yearly basis from 
about the age of 40, 

The treatment of colorectal cancer usually involves invasive surgery to remove 
5 polyps and/or malignant growths. If the cancer has developed beyond the polyp stage 
then more extensive surgery is required which can result in removal of part of the 
bowel and surrounding lymph nodes. In the situation where a cancer necessitates 
extensive surgery a colostomy stoma may be required, at least for a period, to allow 
the bowel to recover from surgery. Surgery in the rectal region is more complicated 

10 and is largely dependent on how far the disease has progressed. In some cases the 
surgery can damage nerves which control sexual and urinary functions. In advanced 
stage colorectal cancers metastatic lesions may require removal and in about 15% of 
cases the lesions are in the liver which requires removal of large parts of the liver. 
The surgical removal of polyps and/or cancerous growths lead to a good prognosis 

15 for patients. In some cases surgery is followed by a course of chemotherapy (for 
colon cancer) and chemotherapy and radiation therapy (rectal cancer) to remove any 
cancer cells not detected during surgery. The chemotherapeutic agents typically used 
to treat colorectal cancer include 5-fluorouracil, leucovorin, irinotecan and 
capecitabine. 

20 

It is apparent that the early detection of cells which are pre-cancerous is highly 
desirable since in most cases surgery to remove these cells results in a very good 
prognosis for patients. Diagnostic tests which use the detection of cancer markers as 
an early indicator of cancer are known in the art. 

25 

For example, EP1355149 describes gene expression profiles from colorectal samples 
to provide a "finger print" expression profile as an indication of whether a patient is 
susceptible to the development of colorectal cancer or indeed if malignant growth has 
already been initiated. The disclosure in EP1355149 is directed to the use of 
30 microarrays to compare transformed and non-transfonned tissue gene expression in a 
global sense. 
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WO02/059609 also describes a gene screen which utilises expression profiles in 
breast and colorectal cancer. A comparison is made between "normal" and 
"abnormal" samples in patients to provide a global picture of gene expression in 
5 these samples as an indicator of particular genes which are either over-expressed or 
abrogated between samples. Both EP1355149 and WO02/059609 take a shot gun 
approach to screening for target genes which can be used either as a diagnostic tool 
or as a target for the development of new chemotherapeutic agents. 

10 The present invention provides a targeted screen for genes the expression of which 
may be altered in a response to carbon source. The invention makes use of the 
differences in expression profiles between normal and diseased tissue as a 
consequence of differences in metabolic state between cancer cells and normal cells 
due in part to carbon source utilisation by these respective cell types. The epithelial 

15 cells which line the colon and rectum metabolise butyrate as a carbon source for 
energy transduction via glycolysis. The main carbon source utilised by tumour cells 
is glucose. Consequently, expression profiles between these cell types are different 
due to the differences in carbon source metabolism. 

20 We have identified a large number of potential markers of colorectal cancer which 
have utility with respect to the early diagnosis of disease and as targets for the 
development of novel chemotherapeutic agents. Moreover, this assay has broader 
applicability to conditions resulting from dysfunction of the bowel (e.g colitis, 
ulcerative colitis, diversion colitis. Crohn's disease and irritable bowel syndrome. In 

25 addition the assay provides a screening tool for fibre consumption and as an assay for 
colon microflora functionality (the effectiveness of fermentation of specific fibres) . 

According to an aspect of the invention there is provided a method to screen for 
nucleic acid molecules which show altered expression in an isolated first cell sample 
30 comprising comparing the gene expression profiles between said first cell sample 
with a second reference cell sample wherein said first cell sample has been grown in 
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the presence of the carbon source butyrate, or a related carbon source fiom which 
butyrate is derived, either directly or indirectly, and comparing said expression 
profile with the expression profile in said second reference cell sample which has not 
been grown in the presence of butyrate, or said related carbon source. 

5 

According to a further aspect of the invention there is provided a method to screen 
for nucleic acid molecules which show altered expression in an isolated biological 
sample comprising the steps of: 

i) providing 

10 a ) a cell growth preparation comprising a first cell sample derived 

fiom at least one region of the colon; cell growth media; and a 
carbon source wherein said carbon source is butyrate; and 
b) a cell growth preparation comprising a second cell sample derived 
from an equivalent region of the colon; cell growth media; and a 

1 5 carbon source which is not butyrate; 

ii) extracting nucleic acid from said first and second cell samples; and 

iii) comparing the gene expression profile in said first cell sample with 
the gene expression profile in said second cell sample. 

20 m a preferred method of the invention said first and second cell samples are 

derived from the ascending colon. 

In an alternative preferred method of the invention said first and second cell 
samples are derived from the transverse colon. 

25 

In a further preferred method of the invention said first and second samples 
are derived fiom the descending colon. 

In a still further preferred method of the invention said first and second 
30 samples are derived from the sigmoid region of the colon. Preferably said 

cell samples are derived from the rectal region of the colon. 
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In a further preferred method of the invention said first and second cell 
samples comprise epithelial cells. 

In a preferred method of the invention said carbon source which is not 
butyrate is glucose. 

In a still further preferred method of the invention said nucleic acid molecule 
which shows altered expression is selected from the group as represented by 
the nucleic acid sequences shown in Table 1, or nucleic acid molecules 
which hybridise to the sequences presented Table 1. Preferably said nucleic 
acid molecules hybridise under stringent hybridisation conditions. 

According to a further aspect of the invention there is provided a method for 
the detection of at least one nucleic acid molecule associated with the 
initiation and/or progression of colorectal cancer, in an animal, comprising 
the steps of: 

i) providing a biological sample comprising at least one cell to 
be tested; 

ii) contacting said sample with a ligand which binds at least one 
nucleic acid molecule as represented by the nucleic acid 
sequence selected from the group consisting of: 

a) a nucleic acid molecule as represented by the nucleic 
acid sequence as shown in Table 1 ; 

b) a nucleic acid molecule which hybridises to nucleic 
acid molecules as defined in (a); 

c) a nucleic acid molecule that is degenerate as a 
consequence of the genetic code to the nucleic acid 
molecule represented in (a) and (b); 

iii) detecting the presence of at least one nucleic acid molecule in 
said sample. 



In a preferred method of the invention said animal is human. 



In a further preferred method of the invention said colorectal cancer is 
adenocarcinoma. 



In a preferred method of the invention said ligand is a nucleic acid molecule adapted 
to anneal to said nucleic acid molecule which is indicative of colorectal cancer. 

It will be apparent to the skilled person that a number of nucleic acid based assay 
systems are available which can be adapted to detect nucleic acid molecules as 
hereindisclosed. For example quantitative polymerase chain reaction assays, in situ 
hybridisation, northern blot 

According to a further aspect of the invention there is provided a method for the 
detection of at least one polypeptide associated with the initiation and/or progression 
of colorectal cancer, in an animal, comprising the steps of: 

i) providing a biological sample comprising at least one cell to be tested; 

ii) contacting said sample with at least one ligand which ligand 
specifically binds at least one polypeptide encoded by a nucleic acid 
molecule as represented by the nucleic acid sequence shown in Table 
1, or a variant polypeptide comprising an amino acid sequence which 
varies by the addition, deletion or substitution of at least one amino 
acid residue; and 

iii) detecting the presence of at least one polypeptide in said sample. 
In a preferred method of the invention said animal is human. 

In a further preferred embodiment of the invention said ligand is an antibody, 
preferably a monoclonal antibody, or at least the effective binding part thereof. 




Methods which utilise antibodies to detect the presence of a polypeptide in a 
biological sample are well known in the art and include ELISA's, western blot and 
immunofluoresence. 

5 According to a further aspect of the invention there is provided the use of at least one 
polypeptide, or variant sequence thereof, encoded by a nucleic acid molecule(s) as 
represented by the nucleic acid sequences as shown in Table 1, as a target for the 
screening of agents which modulate the activity of said polypeptide. 

10 According to a yet further aspect of the invention there is provided a method to 
screen for agents which modulate the activity of at least one gene associated with the 
initiation and/or progression of colorectal cancer comprising the steps of: 

i) forming a preparation comprising at least one polypeptide wherein 
said polypeptide is encoded by a nucleic acid molecule as represented 
15 by the nucleic acid sequence as shown in Table 1, or a variant 

polypeptide comprising an amino acid sequence which varies by the 
addition, deletion or substitution of at least one amino acid residue as 
represented by the amino acid sequences shown in Table 1, and at 
least one agent to be tested; and 
20 ii) detennining the activity of said agent with respect to activity of said 

polypeptide. 

In a preferred method of the invention said polypeptide is expressed by a cell wherein 
said cell is transformed or transfected with said nucleic acid molecule. Preferably 
25 said nucleic acid molecule is part of a vector adapted for recombinant expression of 
said nucleic acid molecule. Preferably said vector is provided with a promoter which 
enables the expression of said nucleic acid molecule to be regulated. 

In a preferred method of the invention said cell is derived from the colon, preferably 
30 said cell is an epithelial cell which lines said colon. 
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In a further preferred method of the invention said agent is an antibody, preferably a 
monoclonal antibody or modified antibody, or at least the effective binding part 
thereof. 

Antibodies, also known as immunoglobulins, are protein molecules which usually 
have specificity for foreign molecules (antigens). Immunoglobulins (Ig) are a class 
of structurally related proteins consisting of two pairs of polypeptide chains, one pair 
of light (L) (low molecular weight) chain (k or X), and one pair of heavy (H) chains 
(Y, a, u., 8 and e), all four linked together by disulpbide bonds. Both H and L chains 
have regions that contribute to the binding of antigen and that are highly variable 
from one Ig molecule to another. In addition, H and L chains contain regions that are 
non-variable or constant 



The L chains consist of two domains. The carboxy-terminal domain is essentially 
15 identical among L chains of a given type and is referred to as the "constant" (C) 
region. The amino terminal domain varies from L chain to L chain and contributes to 
the binding site of the antibody. Because of its variability, it is referred to as the 
"variable" (V) region. 

20 The H chains of Ig molecules are of several classes, a, u, a, a, and y (of which there 
are several sub-classes). An assembled Ig molecule consisting of one or more units 
of two identical H and L chains, derives its name from the H chain that it possesses. 
Thus, there are five Ig isotypes: IgA, IgM, IgD, IgE and IgG (with four sub-classes 
based on the differences in the 'constant' regions of the H chains, i.e., IgGl, IgG2, 
IgG3 and IgG4). Further detail regarding antibody structure and their various 
functions can be found in, Using Antibodies: A laboratory manual, Cold Spring 
Harbour Laboratory Press. 



25 



30 



In a preferred method of the invention said fragment is a Fab fragment. 
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In a further preferred method of the invention said antibody is selected from the 
group consisting of: F(ab , )2, Fab, Fv and Fd fragments; and antibodies comprising 
CDR3 regions. 

5 Preferably said fragments are single chain antibody variable regions (scFV's) or 
domain antibodies. If a hybridoma exists for a specific monoclonal antibody it is well 
within the knowledge of the skilled person to isolate scFv's from mRNA extracted 
from said hybridoma via RT PCR. Alternatively, phage display screening can be 
undertaken to identify clones expressing scFv's. Domain antibodies are the smallest 

10 binding part of an antibody (approximately 13kDa). Examples of this technology is 
disclosed inUS6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are 
all incorporated by reference in their entirety. 

A modified antibody, or variant antibody and reference antibody, may differ in amino 
15 acid sequence by one or more substitutions, additions, deletions, truncations which 
may be present in any combination. Among preferred variants are those that vary 
from a reference polypeptide by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid by another amino acid of 
like characteristics. The following non-limiting list of amino acids are considered 
20 conservative replacements (similar): a) alanine, serine, and threonine; b) glutamic 
acid and asparatic acid; c) asparagine and glutamine d) arginine and lysine; e) 
isoleucine, leucine, methionine and valine and f) phenylalanine, tyrosine and 
tryptophan. Most highly preferred are variants which show enhanced biological 
activity. 

25 

Preferably said antibody is a humanised or chimeric antibody. 

A chimeric antibody is produced by recombinant methods to contain the variable 
region of an antibody with an invariant or constant region of a human antibody. 

30 
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A humanised antibody is produced by recombinant methods to combine the 
complementarity detennining regions (CDRs) of an antibody with both the constant 
(C) regions and the framework regions from the variable (V) regions of a human 
antibody. 

5 

Chimeric antibodies are recombinant antibodies in which all of the V-regions of a 
mouse or rat antibody are combined with human antibody C-regions. Humanised 
antibodies are recombinant hybrid antibodies which fuse the complimentarity 
determining regions from a rodent antibody V-region with the framework regions 

10 from the human antibody V-regions. The C-regions from the human antibody are also 
used. The complimentarity determining regions (CDRs) are the regions within the N- 
terminal domain of both the heavy and light chain of the antibody to where the 
majority of the variation of the V-region is restricted. These regions form loops at the 
surface of the antibody molecule. These loops provide the binding surface between 

15 the antibody and antigen. 

Antibodies from non-human animals provoke an immune response to the foreign 
antibody and its removal from the circulation. Both chimeric and humanised 
antibodies have reduced antigenicity when injected to a human subject because there 

20 is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid 
antibody, while the human antibody regions do not elicit an immune response. This 
results in a weaker immune response and a decrease in the clearance of the antibody. 
This is clearly desirable when using therapeutic antibodies in the treatment of human 
diseases. Humanised antibodies are designed to have less "foreign" antibody regions 

25 and are therefore thought to be less immunogenic than chimeric antibodies. 

In an alternative preferred method of the invention said agent is a polypeptide or a 
peptide. Preferably said polypeptide or peptide is modified. 

JO In a preferred method of the invention said peptide is at least 6 amino acid residues in 
length. Preferaby the length of said peptide/polypeptide is selected from the group 
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consisting of: at least 7 amino acid residues; 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, or 20 amino acid residues in length. Alternatively the length of said 
peptide/polypeptide is at least 20 amino acid residues; 30; 40; 50; 60; 70; 80; 90; or 
100 amino acid residues in length. 

5 

It will be apparent to one skilled in the art that modification to the amino acid 
sequence of peptide agents could enhance the binding and/or stability of the peptide 
with respect to its target sequence. In addition, modification of the peptide may also 
increase the in vivo stability of the peptide thereby reducing the effective amount of 
10 peptide necessary to inhibit the activity of a target polypeptide. This would 
advantageously reduce undesirable side effects which may result in vivo. 
Alternatively or preferably, said modification includes the use of modified amino 
acids in the production of recombinant or synthetic forms of peptides. It will be 
apparent to one skilled in the art that modified amino acids include, by way of 
15 example and not by way of limitation, 4-hydroxyproline, 5-hydroxylysine, N 6 - 
acetyllysine, N^methyllysine, N 6 ,N 6 -dimethyllysine, ^^^-trimethyllysine, 
cyclohexyalanine, D-amino acids, ornithine. Other modifications include amino 
acids with a C 2 , C 3 or C 4 alkyl R group optionally substituted by 1, 2 or 3 substituents 
selected from halo (e.g. F, Br, I), hydroxy or C1-C4 alkoxy. Modifications also 
20 include, by example and not byway of limitation, acetylation and amidation. 

In a preferred embodiment of the invention said peptide sequence is acetylated. 
Preferably said acetylation is to the amino terminus of said peptide. 

25 hi a further preferred embodiment of the invention said peptide sequence is amidated. 
Preferably said amidation is to the carboxyl-terminus of said peptide. 

It will also be apparent to one skilled in the art that peptides could be modified by 
cyclisation. Cyclisation is known in the art, (see Scott et al Chem Biol (2001), 
30 8:801-815; Gellerman et al J. Peptide Res (2001), 57: 277-291; Dutta et al J. Peptide 
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Res (2000), 8: 398-412; Ngoka and Gross J Amer Soc Mass Spec (1999), 10:360- 
363. 



In a further preferred method of the invention said agent is nucleic acid molecule. 
Preferably said nucleic acid molecule is an aptamer or a modified aptamer. In an 
alternative preferred method of the invention said nucleic acid is an inhibitory RNA 
(RNAi) molecule. Alternatively said nucleic acid molecule is an antisense nucleic 
acid molecule. 



10 Nucleic acids have both linear sequence structure and a three dimensional structure 
which in part is determined by the linear sequence and also the environment in which 
these molecules are located. Conventional therapeutic molecules are small 
molecules, for example, peptides, polypeptides, or antibodies, which bind target 
molecules to produce an agonistic or antagonistic effect. It has become apparent that 

15 nucleic acid molecules also have potential with respect to providing agents with the 
requisite binding properties which may have therapeutic utility. These nucleic acid 
molecules are typically referred to as aptamers. Aptamers are small, usually 
stablised, nucleic acid molecules which comprise a binding domain for a target 
molecule. A screening method to identify aptamers is described in US 5,270,163, 

20 which is incorporated by reference. Aptamers are typically oligonucleotides which 
may be single stranded ohgodeoxynucleotides, oligoribonucleotides, or modified 
oUgodeoxynucleotide or oUgoribonucleotides. 



25 



The term "modified" encompasses nucleotides with a covalently modified base 
and/or sugar. For example, modified nucleotides include nucleotides having sugars 
which are covalently attached to low molecular weight organic groups other than a 
hydroxyl group at the 3' position and other than a phosphate group at the 5' position 
Thus modified nucleotides may also include 2' substituted sugars such as 2'-0- 
methyl-; 2-O-alkyl; 2-O-allyl; 2'-S-alkyl; 2'-S-allyl; 2'- fluoro-; 2'-halo or 2;azido- 
30 ribose, carbocyclic sugar analogues a-anomeric sugars; epimeric sugars such as 
arabinose, xyloses or lyxoses, pyranose sugars, furanose sugars, and sedoheptulose. 
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Modified nucleotides are known in the art and include by example and not by way of 
limitation; alkylated purines and/or pyrimidines; acylated purines and/or 
pyrimidines; or other heterocycles. These classes of pyrimidines and purines are 
5 known in the art and include, pseudoisocytosine; N4, N4-ethanocytosine; 8-hydroxy- 
N6-methyladenine; 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil; 5- 
fluorouracil; 5-bromouracil; 5-carboxymethylaminomethyl-2-thiouracil; 5- 
carboxymethylaminomethyl uracil; dihydrouracil; inosine; N6-isopentyl-adenine; 1- 
methyladenine; 1-methylpseudouracil; 1-methylguanine; 2,2-dimethylguanine; 2- 
10 methyladenine; 2-methylguanine; 3-methylcytosine; 5-methylcytosine; N6- 
methyladenine; 7-methylguanine; 5- methylaminomethyl uracil; 5-methoxy amino 
methyl-2-thiouracil; p-D-mannosylqueosine; 5-methoxycarbonylmethyluracil; 5- 
methoxyuracil; 2 methylthio-N6-isopentenyladenine; uracil-5-oxyacetic acid methyl 
ester, psueouracil; 2-thiocytosine; 5-methyl-2 thiouracil, 2-thiouracil; 4-tihdouracil; 5- 
15 methyluracil; N-uracil-5-oxyacetic acid methylester; uracil 5 — oxyacetic acid; 
queosine; 2-thiocytosine; 5-propyluracil; 5-propylcytosine; 5-ethyluracil; 5- 
ethylcytosine; 5-butyluracil; 5-pentyluracil; 5-pentylcytosine; and 2,6,- 
diaminopurine; methylpsuedouracil; 1-methylguanine; l-methylcj^tosine. 

20 The aptamers of the invention are synthesized using conventional phosphodiester 
linked nucleotides and synthesized using standard solid or solution phase synthesis 
techniques which are known in the art Linkages between nucleotides may use 
alternative linking molecules- For example, linking groups of the formula P(0)S, 
(thioate); P(S)S, (dithioate); P(0)NR'2; P(0)R'; P(0)OR6; CO; or CONR'2 wherein 

25 R is H (or a salt) or alkyl (1-12Q and R6 is alkyl (1-9C) is joined to adjacent 
nucleotides through -O- or The binding of aptamers to a target polypeptide is 
readily testable. 

An alternative nucleic acid molecule is a so called RNAi molecule. A recent 
30 technique to specifically ablate gene function is through the introduction of double 
stranded RNA, also referred to as inhibitory RNA (RNAi), into a cell which results 
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in the destruction of mRNA complementary to the sequence included in the RNAi 
molecule. The RNAi molecule comprises two complementary strands of RNA (a 
sense strand and an antisense strand) annealed to each other to form a double 
stranded RNA molecule. The RNAi molecule is typically derived from exonic or 
5 coding sequence of the gene which is to be ablated. Recent studies suggest that 
RNAi molecules ranging from 100-1000bp derived from coding sequence are 
effective inhibitors of gene expression. Surprisingly, only a few molecules of RNAi 
are required to block gene expression which implies the mechanism is catalytic. The 
site of action appears to be nuclear as little if any RNAi is detectable in the cytoplasm 
10 of cells indicating that RNAi exerts its effect during mRNA synthesis or processing. 

In a preferred method of the invention there is provided a cassette comprising a 
nucleic acid molecule, or part thereof, wherein said molecule is selected from the 
group consisting of: 

15 i) a nucleic acid molecule represented by the nucleic acid sequence 

shown in Table 1 ; 

ii) a nucleic acid molecule which hybridises to the sequence in (i) above 
and which encodes a polypeptide which initiates or promotes 
transformation of colon cells; or 
20 m ) a nucleic acid molecule which is degenerate because of the genetic 

code to the sequences defined in (i) and (ii) above, wherein said 
cassette is adapted such that both sense and antisense nucleic acid 
molecules are transcribed from said cassette. 



In a preferred method of the invention said cassette is provided with at least two 
promoters adapted to transcribe both sense and antisense strands of said nucleic acid 
molecule. 

In a further preferred method of the invention said cassette comprises a nucleic acid 
molecule wherein said molecule comprises a first part linked to a second part 
wherein said first and second parts are complementary over at least part of their 
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sequence and further wherein transcription of said nucleic acid molecule produces an 
KNA molecule which forms a double stranded region by complementary base pairing 
of said first and second parts. 

5 hi a preferred embodiment of the invention said first and second parts are linked by at 
least one nucleotide base. 

In a preferred embodiment of the invention said first and second parts are linked by 
2, 3, 4, 5, 6, 7, 8, 9 or at least 10 nucleotide bases. 

10 

In a further preferred embodiment of the invention the length of the RNAi molecule 
is between lOObp-lOOObp. More preferably still the length of RNAi is selected from 
lOObp; 200bp; 300bp; 400bp; 500bp; 600bp; 700bp; 800bp; 900bp; or lOOObp. More 
preferably still said RNAi is at least lOOObp. 

15 

In an alternative preferred method of the invention the RNAi molecule is between 
15bp and 25bp, preferably said molecule is 21bp. Preferably said cassette is part of a 
vector. 



20 According to a further aspect of the invention mere is provided an antibody identified 
by the method according to the invention for use as a pharmaceutical. 

According to a further aspect of the invention there is provided a polypeptide or 
peptide identified by the method according to the invention for use as a 
25 pharmaceutical. 

According to a further aspect of the invention there is provided a nucleic acid 
molecule identified by the method according to the invention for use as a 
pharmaceutical. 

30 

m a preferred embodiment of the invention said nucleic acid molecule is an aptamer. 
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In an alternative preferred embodiment of the invention said nucleic acid molecule is 
an inhibitory RNA. 

5 In a further alternative preferred embodiment of the invention said nucleic acid 
molecule is an antisense nucleic acid molecule. 

In a preferred embodiment of the invention said pharmaceutical further comprises a a 
diluent, carrier or excipient 

10 When administered, the therapeutic compositions of the present invention are 
administered in pharmaceutically acceptable preparations. Such preparations may 
routinely contain pharmaceutically acceptable concentrations of salt, buffering 
agents, preservatives, compatible carriers, supplementary immune potentiating agents 
such as adjuvants and cytokines and optionally other therapeutic agents, such as 

15 chemotherapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for 
example, be oral, intravenous, intraperitoneal, intramuscular, intracavity, 

20 subcutaneous, or transdermal. When antibodies are used therapeutically, a preferred 
route of administration is by pulmonary aerosol. Techniques for preparing aerosol 
delivery systems containing antibodies are well known to those of skill in the art. 
Generally, such systems should utilize components which will not significantly 
impair the biological properties of the antibodies, such as the paratope binding 

25 capacity (see, for example, Sciarra and Cutie, "Aerosols," in Remington^ 
Pharmaceutical Sciences, 18th edition, 1990, pp 1694-1712; incorporated by 
reference). Those of skill in the art can readily determine the various parameters and 
conditions for producing antibody aerosols without resort to undue experimentation. 
When using antisense preparations of the invention, slow intravenous administration 

30 is preferred. 
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The compositions of the invention are administered in effective amounts. An 
"effective amount" is that amount of a composition that alone, or together with 
further doses, produces the desired response. In the case of treating a particular 
5 disease, such as cancer, the desired response is inhibiting the progression of the 
disease. This may involve only slowing the progression of the disease temporarily, 
although more preferably, it involves halting the progression of the disease 
permanently. This can be monitored by routine methods or can be monitored 
according to diagnostic methods of the invention discussed herein. 

10 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical 
condition, size and weight, the duration of the treatment, the nature of concurrent 
therapy (if any), the specific route of administration and tike factors within the 

15 knowledge and expertise of the health practitioner. These factors are weU known to 
those of ordinary skill in the art and can be addressed with no more than routine 
experimentation. It is generally preferred that a maximum dose of the individual 
components or combinations thereof be used, that is, the highest safe dose according 
to sound medical judgment. It will be understood by those of ordinary skill in the art, 

20 however, that a patient may insist upon a lower dose or tolerable dose for medical 
reasons, psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are 
sterile and contain an effective amount for producing the desired response in a unit of 
25 weight or volume suitable for acnmnistration to a patient The response can, for 
example, be determined by measuring the physiological effects of the composition, 
such as regression of a tumour, decrease of disease symptoms, modulation of 
apoptosis, etc. 

30 The doses of pharmaceutical agent administered to a subject can be chosen in 
accordance with different parameters, in particular in accordance with the mode of 
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administration ^ an 4 the _ state ^j 6 ?*- Other factors include the_ desired 

period of treatment. In the event that a response in a subject is insufficient at the 
initial doses applied, higher doses (or effectively higher doses by a different, more 
localized delivery route) may be employed to the extent that patient tolerance 
5 permits. 



In general, doses of pharmaceutical are formulated and adniinistered in doses 
between 1 ng and about 500mg, and between 10 ng and lOOmg, according to any 
standard procedure in the art. Where nucleic acids are employed, doses of between 

10 1 ng and O.lmg generally will be formulated and administered according to standard 
procedures. Other protocols for the administration of compositions will be known to 
one of ordinary skill in the art, in which the dose amount, schedule of injections, sites 
of injections, mode of administration (e.g., intra-tumoral) and the like vary from the 
foregoing. Administration of pharmaceutical compositions to mammals other than 

15 humans, e.g. for testing purposes or veterinary therapeutic purposes, is carried out 
under substantially the same conditions as described above. A subject, as used 
herein, is a mammal, preferably a human, and including a non-human primate, cow, 
horse, pig, sheep, goat, dog, cat or rodent 

20 When administered, the pharmaceutical preparations of the invention are applied in 
phannaceutically-acceptable amounts and in pharmaceutically-acceptable 
compositions. The term "pharmaceutically acceptable" means a non-toxic material 
that does not interfere with the effectiveness of the biological activity of the active 
ingredients. Such preparations may routinely contain salts, buffering agents, 

25 preservatives, compatible carriers, and optionally other therapeutic agents. When 
used in medicine, the salts should be pharmaceutically acceptable, but non- 
pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the 
invention. Such pharmacologically and pharmaceutically-acceptable salts include, 

30 but are not limited to, those prepared from the following acids: hydrochloric, 
hydrobromic, sulfuric, nitric, phosphoric, maleic, acetic, salicylic, citric, formic, 
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malonic, succinic, and the like. Also, pharmaceutically-acceptable salts can be 
prepared as alkaline metal or alkaline earth salts, such as sodium, potassium or 
calcium salts. 

Pharmaceutcial compositions may be combined, if desired, with a pharmaceutically- 
acceptable carrier. The term "pharmaceutically-acceptable carrier" as used herein 
means one or more compatible solid or liquid fillers, diluents or encapsulating 
substances which are suitable for administration into a human. The term "carrier" 
denotes an organic or inorganic ingredient, natural or synthetic, with which the active 
ingredient is combined to facilitate the application. The components of the 
pharmaceutical compositions also are capable of being co-mingled with the 
molecules of the present invention, and with each other, in a manner such that there 
is no interaction which would substantially impair the desired pharmaceutical 
efficacy. 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a 
salt. 

The pharmaceutical compositions also may contain, optionally, suitable 
preservatives, such as: benzalkonium chloride; chlorobutanol; parabens and 
thimerosal. 

The pharmaceutical compositions may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well-known in the art of pharmacy. All 
methods include the step of bringing the active agent into association with a carrier 
which constitutes one or more accessory ingredients. In general, the compositions 
are prepared by uniformly and intimately bringing the active compound into 
association with a liquid carrier, a finely divided solid carrier, or both, and then, if 
necessary, shaping the product. 
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.._ Comp ° si ^ ons suitable for oral administration may be presented as discrete units, 

such as capsules, tablets, lozenges, each containing a predetermined amount of the 
active compound. Other compositions include suspensions in aqueous liquids or 
non-aqueous liquids such as a syrup, elixir or an emulsion. 

5 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of pharmaceutical agents, which is preferably 
isotonic with the blood of the recipient. This preparation may be formulated 
according to known methods using suitable dispersing or wetting agents and 

10 suspending agents. The sterile injectable preparation also may be a sterile injectable 
solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for 
example, as a solution in 1,3-butane diol. Among the acceptable vehicles and 
solvents that may be employed are water, Ringer's solution, and isotonic sodium 
chloride solution. In addition, sterile, fixed oils are conventionally employed as a 

15 solvent or suspending medium. For this purpose any bland fixed oil may be 
employed including synthetic mono-or di-glycerides. In addition, fatty acids such as 
oleic acid may be used in the preparation of injectables. Carrier formulation suitable 
for oral, subcutaneous, intravenous, intramuscular, etc. administrations can be found 
in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 

20 

An embodiment of the invention will now be described by example only and with 
reference to the following Figures and Tables; 

Figure 1 illustrates a concentration-response of cells growing in butyrate as sole 
25 carbon source. This is the summary of four independent repeat experiments. Legend 
shows butyrate concentrations in mM; 

Figure 2 illustrates the purity and quality of RNA preparation. The 28S and 18S 
sample bands are tight and clearly resolved for RNA prepared from butyrate- and 
30 glucose-grown cells. Little or no DNA or salt contamination appears in the samples; 
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Tablel illustrates nucleic acid and protein sequences identified by the screening 
method according to the invention; and 

5 Table 2 illustrates a summary of expression data of nucleic acid sequences identified 
in Tablel. 

Materials and Methods 

10 We have compared the expression profiles of colon cells growing in either glucose or 
butyrate as a carbon source. HT 29 colon carcinoma cells were cultured in DMEM 
medium (Gibco) in the presence of 10% foetal calf serum, penicillin and 
streptomycin. Cells were either cultured in glucose alone as the sole carbon source, or 
in butyrate as the sole extraneous provided carbon source. Empirical analysis of 

15 HT29 cells grown in multiple butyrate concentrations revealed that 2mM butyrate 
was optimal for cell culture in the absence of glucose. Cells were cultured in either 
medium for multiple passages (typically 4). RNA was extracted from cells grown in 
each condition and used to probe an Affymetrix human 12k array. The expression 
profile of cells cultured in each condition was compared and genes altered in 

20 expression by more than 2 fold are listed in Table 2. 



Materials used daring this study 



ITEM 


ITEM - SPECIFICS 


SUPPLIER 








Glucose medium (1) 


Dulbecco's Modified Eagle 
Medium 25 mM HEPES 1 
x 0.1 micron filtered with 
sodium pyruvate, with 1000 


GIBCO 
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mg/1 glucose with 
pyridoxine + FCS + p/s (500 
ml) 










Butyrate medium (2) 
0.2 mM NaB medium 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
filtered with L-glutamine 
wimoui giucose, witnout 
sodium pyruvate + NaB 
(1M) 1 10^1 + FCS + p/s 
(555.1 ml) 


GIBCO 








Butyrate medium (3) 
2 mM NaB medium 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
filtered with L-glutamine 

VI fl 4" /\1 1^ Alii AAAA « v *«4>1a 

wiinoui giucose, witnout 
sodium pyruvate + NaB 
(lM)1100nl + FCS+p/s . 
(556.1 ml) 


GIBCO 








Medium without 
giucose ana witnout 

Vintvrsitf* {A\ 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
nitered witn .L-glutamine 
without glucose, without 
sodium pyruvate + FCS + 
p/s (550 ml) 


GIBCO 








NaB stock 


Sodium Butyrate powder 
dissolved in sterile water 
250 mg in 2.27 ml water 


Sigma 
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r 


- 


(1M) 0.2 pm filter sterilised 










♦ 






Sterile syringes 


5ml 


Becton Dickinson UK, Ltd 












Sterilising filters 


0.2 \xm Acrodisc 


Gelman Sciences, Ltd 






Item specifics 


Supplier 

1 












FCS 


Foetal Calf Serum 50 ml per j 
500 mlDMEM | 


Harlan Sera Lab 












P/S 


Penicillin — Streptomycin ! 
solution 100ml bottle (100 
X)- 5 ml per 500 ml 
DMEM 


Sigma 








TF for snlittinff cells 


Trypsin Enzyme - 100 ml 
bottle - 3 ml per T75 and 1 
ml per 6 well plate well 


j Sigma 










FCS tubes 


50 ml Centrifuge tubes 


Corning Inc 










P/S + TE tubes 


30 ml Universal containers 


T" BibbySterilinLtd 












Tissue Culture Plates 


6 well sterile with lid single 
packed 


Greiner bio-one 












Tissue Culture Flasks 


T75 


Nunclon 












Stripette ® 5ml, 10ml, 

V 


Serological Pipette, 


Corning Inc / Costar 
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25 ml 


individually wrapped 


1 — 








i Pmette 


Jroweipette plus 


Jencons 








fVIl Pmmtin rr QKHo 

v^cii v^uuniing oilQC 


Haemocytometer, improved 
Neubauer 


Neubauer 








PtVlJVnnl "for ticciiA* 

culture 


/U % JbtUH 


Sigma 








v liisAJii lur well Culture 


1 % Virkon 


Day Impex, Ltd 








work 


i^ignt o — lux 


CK Olympus, Tokyo 








x dp CI IUWC1S 


Blue 


Jamont (UK), Ltd 








-Luetic A -JL1CC CXoIIllIlaUOIl 

gloves 


Large 


Shermond Surgical Supply, 
Ltd 








Item 


Item specifics 


Supplier 








RNA extraction reaeent 


liuzoi v*y xvcageni 


mvitrogen — i_Aie 
technologies 


RNA extraction reagent 


Chloroform 


Sigma 








RNA extraction reagent 


Isopropyl alcohol 


Sigma 






===== SBB^— — = 
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1 RN A extraction reagent 


75% EtOH in DEPC-treated 
water 


Sigma 








RNA extraction reagent 


Rnase-free water 


Sigma 








RNA clean ud kit 


RneasyMidiKit (10 
RNeasy midi spin columns) 


Qiagen 








B- Mercatrtoethanol 


14.3 M stock solution 


Sigma 








Rthanol for Oiaeen 


96-100% EtOH 


Sigma 










lg in 100 ml TB-EDTA- 
Buffer 


Helena Biosciences, UK 








TR-FDTA- Buffer 

If i iJ J.il/ X iTL ±J lUlvl 


Tris-Borate-EDTA buffer 
100ml 


Sigma 








Eppendorf tubes 


1.5 ml 


Sarstedt Laboratory 
supplies, Ltd 






- TL | 


Loading buffer 


6X 











The Human Colon Carcinnma Cell Line - HT29 



The HT29 cell line is established from a colon adenocarcinoma which was removed 
5 from a 44 year old Caucasian woman. The cell line is epithelial in origin and 
hypertriploid. It has been shown to be tumourigenic in nude mice and synthesizes 
Carcino embryonic antigen - CEA (Egan & Todd, 1972) and the Transforming 

25 



10 



15 



growth factors - TGF-a and TGF-p (Anzano et al. 1989) when maintained in vitro. 
The HT29 cell line constitutively over-produces mutant p53 protein as a consequence 
of a point mutation at codon 273, resulting in an Arginine to Histidine amino acid 
substitution (Hsu et al. 1994). 

The Culture of HT29 C olorectal adenocarcinoma cells 

Cells were cultured in T75 tissue culture flasks (Nunclon) in 5% C0 2 at 37°C. Cells 
were passaged when confluent by washing twice in PBS and incubating in pre- 
wanned trypsin : EDTA (1:1) at 37°C until cells detached. The cells were then 
re-suspended in the appropriate growth medium, either glucose DMEM or butyrate 
DMEM before being seeded into new T75 tissue culture flasks or 6-well plates. 

Optimisation of HT29 cell growth in bu tyrate as sole extraneous carbon source 



HT29 cells were seeded out into 19 wells (in 6 well plates) at a cell density of 0.5 x 
10 6 cells per well (i.e. 500 000 cells per well) deduced with the aid of a 
Haemocytometer (Improved Neubauer). These cells were taken from T75 - 0.2 mM 
butyrate (NaB) DMEM flasks and allowed to adhere to the 6-well plates over 72 hrs 

20 also in 0.2 mM NaB DMEM with FCS and Penicillin / Streptomycin antibiotics. 
After the cells had adhered to the surface of the 6 well plates the 0.2 mM NaB 
DMEM was removed and each well was washed twice with PBS in order to remove 
all traces of the 0.2 mM DMEM then different concentrations of NaB DMEM with 
FCS and with Penicillin / Streptomycin antibiotics were added to the appropriate 

25 wells in triplicate. Cell counts were taken at various time points. Specific media was 
changed daily in order to maintain the appropriate / desired NaB concentrations per 
well. All solutions / reagents used were pre-warmed in a water bath prior to use so as 
to avoid any cold shock to the cells. 



30 



26 



RNA extraction using TRIzol® Reagent 



Total RNA was extracted from HT29 cells grown to confluence in T75 flasks using 
TRIzol Reagent as per manufacturer's recommendations. Cells were grown for 
several passages either in butyrate-containing medium, or in glucose-containing 
medium prior to extraction of RNA 

Cells were homogenised using 1 ml TRIzol Reagent per 10 cm 2 area of culture 
surface. The homogenised samples were incubated for 5 minutes at at ambient 
temperature to permit the complete dissociation of nucleoprotein complexes. 200pl 
of chloroform was added to each sample. Tubes were shaken vigorously by hand for 
15 seconds and incubated at ambient temperature for 3 minutes. Samples were 
centrifuged at 12000g for 15 minutes at 4oC. RNA in the aqueous phase was 
separated and precipitated using isopropyl alcohol. RNA was rinsed, air dried and 
redissolved in RNase-free water. 

RNA was further purified using Qiagen RNeasy columns. The columns were used 
exactly as per manufacturers recommendations. RNA was eluted into RNase-free 
water. 

RNA purified in this way was analysed by agarose gel to establish purity and quality. 
The gel is shown in figure 2. 

Microarrav analysis 

Microarray analysis was undertaken as a commercial service by the University of 
Newcastle-upon-Tyne. In this study, the 2 RNA samples (lx butyrate + lx glucose) 
from the 2 experimental conditions (butyrate + glucose) were sent to the Institute for 
Human Genetics at the University of Newcastle-upon-Tyne for microarray analysis. 
This was performed on a 12 k Asymetrix Homo sapiens gene chip. Genes altered in 
expression by more than 2 fold on the microarray are listed in table 1 . 
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Claims 



1. A method to screen for nucleic acid molecules which show altered expression 
in an isolated first cell sample comprising comparing the gene expression profiles 

5 between said first cell sample with a second reference cell sample wherein said first 
cell sample has been grown in the presence of the carbon source butyrate, or a related 
carbon source from which butyrate is derived, either directly or indirectly, and 
comparing said expression profile with the expression profile in said second 
reference cell sample which has not been grown in the presence of butyrate, or said 
10 related carbon source. 

2. A method according to Claim 1 wherein said screen for nucleic acid 
molecules comprises the steps of: 

i) providing 

a) a cell growth preparation comprising a first cell sample derived 
from at least one region of the colon; cell growth media; and a 
carbon source wherein said carbon source is butyrate; and 

b) a cell growth preparation comprising a second cell sample derived 
from an equivalent region of the colon; cell growth media; and a 
carbon source which is not butyrate; 

ii) extracting nucleic acid from said first and second cell samples; and 

iii) comparing the gene expression profile in said first cell sample with 
the gene expression profile in said second cell sample. 

3. A method according to Claim 1 or 2 wherein said first and second cell 
samples are derived from the ascending colon. 

4. A method according to Claim 1 or 2 wherein said first and second cell 
samples are derived from the transverse colon. 

30 
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5. A method according to Claim 1 or 2 wherein said first and second 
samples are derived from the descending colon. 

6. A method according to Claim 1 or 2 wherein said first and second 
samples are derived from the sigmoid region of the colon. 

7. A method according to Claim 6 wherein said cell samples are derived 
from the rectal region of the colon. 

8. A method according to any of Claims 1-7 wherein said first and 
second cell samples comprise epithelial cells. 

9. A method according to any of Claims 1-8 wherein said carbon source 
which is not butyrate is glucose. 

10. A method according to any of Claims 1-9 wherein said nucleic acid 
molecule which shows altered expression is selected from the group as 
represented by the nucleic acid sequences as shown in Table 1, or nucleic 
acid molecules which hybridise to the sequences presented in Table 1 . 

11. A method for the detection of at least one nucleic acid molecule 
associated with the initiation and/or progression of colorectal cancer, in an 
animal, comprising the steps of: 

i) providing a biological sample comprising at least one cell to 
be tested; 

ii) contacting said sample with a ligand which binds at least one 
nucleic acid molecule as represented by the nucleic acid 
sequence selected from the group consisting of: 

a) a nucleic acid molecule as represented by the nucleic acid 
sequence as shown in Table 1; 
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b) a nucleic acid molecule which hybridises to nucleic acid 
molecules as defined in (a); 

c) a nucleic acid molecule that is degenerate because of the genetic 
code to the nucleic acid molecule represented in (a) and (b); and 

5 iii) detecting the presence of at least one nucleic acid molecule in 

said sample. 

12. A method according to Claim 11 wherein said colorectal cancer is 
adenocarcinoma. 

10 

13. A method according to Claim 11 or 12 wherein said ligand is a nucleic acid 
molecule adapted to anneal to said nucleic acid molecule which is associated with 
colorectal cancer. 

15 14. A method according to Claim 13 wherein said method is a polymerase chain 
reaction method. 

15. A method for the detection of at least one polypeptide associated with the 
initiation and/or progression of colorectal cancer, in an animal, comprising the steps 
20 of: 

i) providing a biological sample comprising at least one cell to be tested; 

ii) contacting said sample with at least one ligand which ligand 
specifically binds at least one polypeptide encoded by a nucleic acid 
molecule as represented by the nucleic acid sequence as shown in 

25 Table 1, or a variant polypeptide comprising an amino acid sequence 

which varies by the addition, deletion or substitution of at least one 
amino acid residue of the amino acid sequence shown in Table 1; and 

iii) detecting the presence of at least one polypeptide in said sample. 

30 16 A method according to any of Claims 1 1-15 wherein said animal is human. 
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17. A method according to Claim 15 or 16 wherein said ligand is an antibody. 



18. A method according to Claim 17 wherein said antibody is a monoclonal 
antibody, or at least the effective binding part thereof. 

t 

5 

19. The use of at least one polypeptide, or variant sequence thereof, encoded by a 
nucleic acid molecule(s) as represented by the nucleic acid sequence as shown in 
Table 1, as a target for the screening of agents which modulate the activity of said 
polypeptide. 

10 

20. A method to screen for agents which modulate the activity of at least one 
polypeptide encoded by a gene associated with the initiation and/or progression of 
colorectal cancer comprising the steps of: 

i) forming a preparation comprising at least one polypeptide wherein 
15 said polypeptide is encoded by a nucleic acid sequence as shown in 

Table 1, or a variant polypeptide comprising an amino acid sequence 
which varies by the addition, deletion or substitution of at least one 
amino acid residue of the amino acid sequence shown in Table 1 and 
at least one agent to be tested; and 
20 ii) deter mining the activity of said agent with respect to activity of said 

polypeptide. 

21 . A method according to Claim 20 wherein said polypeptide is expressed by a 
cell wherein said cell is transformed or transfected with said nucleic acid molecule. 

25 

22. A method according to Claim 21 wherein said nucleic acid molecule is part of 
a vector adapted for recombinant expression of said nucleic acid molecule. 

23. A method according to Claim 22 wherein said vector is provided with a 
30 promoter which enables the expression of said nucleic acid molecule to be regulated. 



31 



24. A method according to any of Claims 21-23 wherein said cell is derived from 
the colon. 

25. A method according to Claim 24 wherein said cell is an epithelial cell. 

5 

26. A method according to any of Claims 20-25 wherein said agent is an 
antibody. 

27. A method according to Claim 26 wherein said antibody is a monoclonal 

10 antibody or modified monoclonal antibody, or at least the effective binding part 
thereof. 

28. A method according to Claim 27 wherein said binding part is a Fab fragment. 

15 29. A method according to Claim 28 wherein said antibody is selected from the 
group consisting of: FCab 1 )^ Fab, Fv and Fd fragments; antibodies comprising CDR3 
regions, and single chain antibody variable regions. 

30. A method according to Claim 26 wherein said antibody is a humanised. 

20 

31. A method according to Claim 26 wherein said antibody is a chimeric 
antibody. 

32. A method according to any of Claims 20-25 wherein said agent is a 
25 polypeptide. 

33. A method according to any of Claims 20-25 wherein said agent is a peptide. 

34. A method according to any of Claims 20-25 wherein said agent is nucleic acid 
30 molecule. 
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35. A method according to Claim 34 wherein said nucleic acid molecule is an 
aptamer. 

36. A method according to Claim 34 wherein said nucleic acid is an inhibitory 
5 RNA molecule. 

37. A method according to Claim 36 wherein said inhibitory RNA is encoded by 
a transcription cassette comprising a nucleic acid molecule, or part thereof, selected 
from the group consisting of: 

10 i) a nucleic acid molecule as represented by the nucleic acid sequence 

as shown in Table 1; 

ii) a nucleic acid molecule which hybridises to the sequence in (i); or 

iii) a nucleic acid molecule which is degenerate because of the genetic 
code to the sequences defined in (i) and (ii) above; wherein said 

15 cassette is adapted such that both sense and antisense nucleic acid 

molecules are transcribed from said cassette. 



38. A method according to Claim 37 wherein said cassette is provided with at 
least two promoters adapted to transcribe both sense and antisense strands of said 

20 nucleic acid molecule. 

39. A method according to Claim 37 wherein said cassette comprises a nucleic 
acid molecule wherein said molecule comprises a first part linked to a second part 
wherein said first and second parts are complementary over at least part of their 

25 sequence and further wherein transcription of said nucleic acid molecule produces an 
RNA molecule which forms a double stranded region by complementary base pairing 
of said first and second parts. 



40. A method according to Claim 34 wherein said nucleic acid molecule is an 
30 antisense nucleic acid molecule. 
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41. An antibody, or effective binding part thereof, identified by the method 
according to any of Claims 26-3 1 for use as a pharmaceutical. 

42. A polypeptide identified by the method according to Claim 32 for use as a 
5 pharmaceutical. 

43. A peptide identified by the method according to Claim 33 for use as a 
pharmaceutical. 

10 44. A nucleic acid molecule identified by the method according Claim 34 for use 
as a pharmaceutical. 

45. Use according to Claim 44 wherein said nucleic acid molecule is an aptamer. 

15 46. Use according to Claim 44 wherein said nucleic acid molecule is an inhibitory 
RNA. 

47. Use according to Claim 44 wherein said nucleic acid molecule is an antisense 
nucleic acid molecule. 

20 48. Use according to any of Claims 41-47 wherein said pharmaceutical further 
comprises a a diluent, carrier or excipient. 



25 
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Abstract 



We describe a method for the identification of genes which show regulated 
expression in response to carbon source utilisation, typically genes associated with 
the initiation and/or promotion of cell transformation from a non-cancerous to a 
cancerous phenotype, typically of cells found in the colon; the use of these genes in 
diagnostic assays and as targets for the development of chemotherapeutic drugs and 
agents identified by said assay. 
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TABLE 1 



— ^vx*^***/^ v vivxriuavcrjjor WXIV3IM.L1AH v iKXr^i y^UjJNFAFKDKYKQIjFLGGVE) 
RHKQFWRYF AGNLASGGAAGATS LCFVYPLDFARTRLAADVGRRAQREFHGLGDC I IKI 
FKSDGLRGLYQGFNVSVQGI 1 I YRAAYFGVYDTAKGMLPDPKNVHIFVS WMXAQSVTAV 
AGLLSYPFDTVRRRMMMQSGRKGADIMYTGTVDOTRKIAM 
GGAFVLVLYDEIKKYV" 



-*C J02966; 

DE Human mitochondrial ADP/ADT translocator mRNA, complete cds. 
!CX 

KW ADP/ADT translocator. 

/ translat ion= "MGDHAWSFLKDFLAGAVAAAVSKTAVAPIERVKLLLQVQHZ^SKQI 

; T S AEKQYKG I IDCWRI PKEQGFLS FWRGNLANVI RYFPTQALNFAFKDKYKQLFLGGVE) 

?T RHK0FWRYFAGNLASGGAAaaT.QT.rT?Trv'DT.'ni?aT5'TOT 71 KminnTtik^n^T 

?T 
?T 
?T 
CX 

3Q Sequence 1320 BP; 341 A; 304 C; 357 G; 318 T; 0 other; 
ccccctagcg tcgcgcaggg tcggggactg cgcgcggtgc 
gcacgaacgg gctgctgcgg gctgagagcg tcgagctgtc 
gagcttccta aaggacttcc tggccggggc ggtcgccgct 
cgcccccatc gagagggtca aactgctgct gcaggtccag 
tgctgagaag cagtacaaag ggatcattga ttgtgtggtg 
cttcctctcc ttctggaggg gtaacctggc caacgtgatc 
tctcaacttc gccttcaagg acaagtacaa gcagctcttc 
taagcagttc tggcgctact ttgctggtaa cctggcgtcc 
ctccctttgc tttgtctacc cgctggactt tgctaggacc 
caggcgcgcc cagcgtgagt tccatggtct gggcgactgt 
tgatggcctg agggggctct accagggttt caacgtctct 
tagagctgcc tacttcggag tctatgatac tgccaagggg 
cgtgcacatt tttgtgagct ggatgattgc ccagagtgtg 
gtcctacccc tttgacactg ttcgtcgtag aatgatgatg 
cgatattatg tacacgggga cagttgactg ctggaggaag 
caaggccttc ttcaaaggtg cctggtccaa tgtgctgaga 
attggtgttg tatgatgaga tcaaaaaata tgtctaatgt 
gatttacatg aacttgatct acaagttcac agatccattg 
ctaggggaag taaaaagatc tgggataaaa ccagactgaa 
gcttcattga gtgttcatta aaccacacat gtattttgta 
ccacagcaaa tagaaataat ttatcatact tgtacaatta 



caggccgggc 


gtgggcgaga 


60 


accatgggtg 


atcacgcttg 


120 


gccgtctcca 


agaccgcggt 


180 


catgccagca 


aacagatcag 


240 


agaatcccta 


aggagcaggg 


300 


cgttacttcc 


ccacccaagc 


360 


ttagggggtg 


tggatcggca 


420 


ggtggggccg 


ctggggccac 


480 


aggttggctg 


ctgatgtggg 


540 


atcatcaaga 


tcttcaagtc 


600 


gtccaaggca 


tcattatcta 


660 


atgctgcctg 


accccaagaa 


720 


acggcagtcg 


ca gggctgct 


780 


cagtccggcc 


ggaaaggggc 


840 


attgcaaaag 


acgaaggagc 


900 


ggcatgggcg 


gtgcttttgt 


960 


aattaaaaca 


caagttcaca 


1020 


tgtggtttaa 


tagactattc 


1080 


aggaatacct 


cagaagagat 


1140 


tttattttac 


atttaaattc 


1200 


actgaagaat 


tgataataac 


1260 


aaaaaaaaaa 


aaaaaaaaaa 


1320 



HSA132099 standard; mRNA; HUM; 3109 BP. 
Homo sapiens mRNA for VNN1 protein 

vanin-like gene; vnnl gene; VNN1 protein. 

/protein_id= ■ CAA1 05 6 8.1" 

/ translations "MTTQLPAYVAILLFYVSRASCQDTFIAAVYEHAAILPNATLTPV^ 

REEALAIiMNRNLDILEGAITSAADQGAHIIVTPEDAIYGWlTFNRDSIjYP 

NWIPCNNRNRFGQTPVQERIiSCIjAKI^SIYVVANIGDKKPCT 

WFDSQGKLVARYHKQNLFMGENQFNVPKEPE I VTFNTTFGS FGI FTCFD ILFHDPAVT 
LVKDFHVDTIVFPTAWMNVLPHLSAVEFH^ 

PNSSRAFHYDMKTEEGKIiLLSQLDSHPSHSAVVNWTSYASSIEAIiSS 
EFTFVKLTGVAGNYTVCQKDLCCHLS YKMS ENI PNEVYALGAFDGIiHTVEGRYYLQI CT 
I,LKCKTTimNTCX5DSAETASTRFEMFSLSGTFGTQYV^ 

RLFSLKPTSGPVIjTVTLFGRIjYEKDWASNASSGIjTAQARI IMLI VIAP I VCSLSW " 
Sequence 3109 BP; 973 A; 630 C; 601 G; 905 T; 0 other; 

cattggactt cagcatgact actcagttgc cagcttacgt ggcaattttg cttttctatg 60 

tctcaagagc cagctgccag gacactttca ttgcagctgt ttatgagcat gcagcgatat 120 

tgcccaatgc caccctaaca ccagtgtctc gtgaggaggc tttggcatta atgaatcgga 180 

atctggacat tttggaagga gcgatcacat cagcagcaga tcagggtgcg catattattg 240 

tgactccaga agatgctatt tatggctgga acttcaacag ggactctctc tacccatatt 3 00 

tggaggacat cccagaccct gaagtaaact ggatcccctg taataatcgt aacagatttg 360 

gccagacccc agtacaagaa agactcagct gcctggccaa gaacaactct atctatgttg 420 

tggcaaatat tggggacaag aagccatgcg ataccagtga tcctcagtgt ccccctgatg 480 

gccgttacca atacaacact gatgtggtat ttgattctca aggaaaactg gtggcacgct 540 

accataagca aaaccttttc atgggtgaaa atcaattcaa tgtacccaag gagcctgaga 600 

ttgtgacttt caataccacc tttggaagtt ttggcatttt cacatgcttt gatatactct 660 

tccatgatcc tgctgttacc ttggtgaaag atttccacgt ggacaccata gtattcccaa 720 

cagcttggat gaatgttttg ccacatttgt cagctgttga attccactca gcttgggcta 7 80 

tgggcatgag ggtcaatttc cttgcatcca acatacatta cccctcaaag aaaatgacag 840 

gaagtggcat ctatgcaccc aattcttcaa gagcatttca ttatgatatg aagacagaag 900 

agggaaaact cctcctctcg caactggatt cccacccatc ccattctgca gtggtgaact 960 

ggacttccta tgccagcagt atagaagcgc tctcatcagg aaacaaggaa tttaaaggca 1020 

ctgtcttttt cgatgaattc acttttgtga agctcacagg agttgcagga aattatacag 1080 

tttgtcagaa agatctctgc tgtcatttaa gctacaaaat gtctgagaac ataccaaatg 1140 

aagtgtacgc tctaggggca tttgacggac tgcacactgt ggaagggcgc tattatctac 1200 

agatttgtac cctgttgaaa tgtaaaacga ctaatttaaa cacttgcggt gactcagctg 1260 

aaacagcttc taccaggttt gaaatgttct ccctcagtgg cactttcgga acccagtatg 1320 

tctttcctga ggtgttgctg agtgaaaatc agcttgcacc tggagaattt caggtgtcaa 13 80 

ctgacggacg cttgtttagt ctgaagccaa catccggacc tgtcttaaca gtaactctgt 1440 

ttgggaggtt gtatgagaag gactgggcat caaatgcttc atcaggcctc acagcacaag 1500 

caagaataat aatgctaata gttatagcac ctattgtatg ctcattaagt tggtagaata 1560 

ttgacttttt ctctttttta tttgggataa tttaaaaaat gatggatgag aaaagaaaga 1620 

ttggtccggg ttaatattat cctctagtat aagtgaatta ctagtttctc tttatttaga 1680 

caaacacaca cacaccagat aatataaact taataaatta tctgttaatg tagattttat 1740 

ttaaaaaact atatttgaac attggtcttt cttggacgtg agctaattat atcaaataag 1800 

tatcacaaat cttttacgca gaagaaataa aaactacggg tagaaaacat aagaactatc 1860 

ataaaattta cttacaagga ggctgctctt gttaccactt ttattatatt acgtatcact 1920 

tattcagctc tgctgaaaat ttccaatgac tttgtttgtt tgctctttta gttttttacc 1980 

taaacaatac attttgattc tcttgtgggt tgataatgtc tccccaaaat ttacatgttg 2040 

aagcacctca gaatgtgact gtatttggag acagggtctt taaagaggta aaataaggtc 2100 

attaggatag accctaattc aatatgactg atgatcataa aagaagaggc gagtagggca 2160 

caacaggcac aaagggagac cataaggaga cacagaggaa ggacaactct ttacaagcta 2220 

agaagagagg gcctcagaag aaaccaaccc tgccaacacc ttgatcttgg acttccagcc 2280 

tccaaaacta tgagaaataa atttctattg tttaagtcac ccagtccatg gtactttgtt 2340 

aggcagccct ggcaaatgaa tcaaagaccc attcctgttc ctctccccac cactactgtt 2400 

ttctactgta atctgaagct tcaacaaaag gcttacctgg taagaatatt cagctggtct 2460 



7 



gggtcctcaa 
accattagat 
tagatattta 
cacaactcag 
tgagcactaa 
gcactgtgca 
gccagcaaaa 
ctatatcatc 
catttgcttt 
ggaatcctca 
ttgatgtgtt 



gactccaata 
cattgaattc 
atatagcatt 
tgaaacaaca 
tatcatagtg 
aagcatgccc 
catcattttg 
caagaaatat 
ttttcaagac 
gagcaattct 
ggtgataggc 



gacactctta 
ctctggaatt 
gtgtgttcta 
tcatttaagc 
ctgtgatgat 
ttctgcacag 
agacaaacat 
ttgagtaaaa 
ctaatagaaa 
ctgtattctc 
ttgaattatt 



aagaaggatt 
agaaaaccag 
ttttagtaac 
caaaatatct 
ggacaattac 
gagagcaagg 
ttttgtggca 
tcccttgttc 
ataagaaagc 
atataatttc 
aaaaacttca 



gctgatggat 
agagtcccat 
agcagaatct 
cccaactgac 
atagtaccga 
cacttgcagt 
gatgtttttc 
ttttgggtga 
ccataatgta 
aatgtaaaac 
aaaacaaaa 



tgatagtgaa 
tttaagaaat 
cttgacatta 
tgatagactc 
taacagccat 
agtgatctat 
ctaaaaagta 
cattaactga 
tttagaaaca 
agaaaacata 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3109 




Homo sapiens transmembrane protein 5, mRNA (cDNA clone MGC:170 85 
IMAGE : 3 919181) , complete cds. 

/protein_id="AAH13152 . 1" 

/trans 1 at i on= "MRLTRKRIiCS FlilALYCLFSLYAAYHVFFGRRRQAPAGSPRGLRK 
GAAPMlERRGREQSTLESEEWNPWEGDEKircQQHRFKTSIiQI 

KAAIGLYIiWEHI FEGIiIiDPSDVTAQWREGKS I VGRTQY S F I TGP AVI PGYFS VDVNNW 
LILNGREKAKIFYATQWLLYAQNLVQIQKLQHl^ 
ELLFIIYDSPWINDVDVFQWPLGVATYRNFPVVR^ 
RQALMNILKKDGNDKIjCWVSAREHWQPQ^ 

IYEACS YGS I P WEDVMTAGNCGNTSVHHGAPLQIiLKSMGAPF I F I KNWKELPAVIjEKE 
KT 1 1 LQEKI ERRKMLLQWYQHFKTEIjKMKFTN I LESS FLMNNKS " 

Sequence 1469 BP; 446 A; 300 C; 349 G; 374 T; 0 other; 
ggctgggcct gcctcggacg ccgccggtgt cgcggattct ctttccgccc gctccatggc 
ggtggatgcc tgactggaag cccgagtggg atgcggctga cgcggaagcg gctctgctcg 
tttcttatcg ccctgtactg cctattctcc ctctacgctg cctaccacgt cttcttcggg 
cgccgccgcc aggcgccggc cgggtccccg cggggcctca ggaagggggc ggcccccgcg 
cgggagagac gcggccgaga acagtccact ttggaaagtg aagaatggaa tccttgggaa 
ggagatgaaa aaaatgagca acaacacaga tttaaaacta gccttcaaat attagataaa 
tccacgaaag gaaaaacaga tctcagtgta caaatctggg gcaaagctgc cattggcttg 
tatctctggg agcatatttt tgaaggctta cttgatccca gcgatgtgac tgctcaatgg 
agagaaggaa agtcaatcgt aggaagaaca cagtacagct tcatcactgg tccagctgta 
ataccagggt acttctccgt tgatgtgaat aatgtggtac tcattttaaa tggaagagaa 
aaagcaaaga tcttttatgc cacccagtgg ttactttatg cacaaaattt agtgcaaatt 
caaaaactcc agcatcttgc tgttgttttg ctcggaaatg aacattgtga taatgagtgg 
ataaacccat tcctcaaaag aaatggaggc ttcgtggagc tgcttttcat aatatatgac 
agcccctgga ttaatgacgt ggatgttttt cagtggcctt taggagtagc aacatacagg 
aattttcctg tggtggaggc aagttggtca atgctgcatg atgagaggcc atatttatgt 
aatttcttag gaacgattta tgaaaattca tccagacagg cactaatgaa cattttgaaa 
aaagatggga acgataagct ttgttgggtt tcagcaagag aacactggca gcctcaggaa 
acaaatgaaa gtcttaagaa ttaccaagat gccttgcttc agagtgatct cacattgtgc 
ccggtcggag taaacacaga atgctatcga atctatgagg cttgctccta tggctccatt 
cctgtggtgg aagacgtgat gacagctggc aactgtggga atacatctgt gcaccacggt 
gctcctctgc agttactcaa gtccatgggt gctcccttta tctttatcaa gaactggaag 
gaactccctg ctgttttaga aaaagagaaa actataattt tacaagaaaa aattgaaaga 
agaaaaatgt tacttcagtg gtatcagcac ttcaagacag agcttaaaat gaaatttact 
aatattttag aaagctcatt tttaatgaat aataaaagtt aattatcttt ttgagctaaa 
laaaaaaaa aaaaaaaaaa aaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



XX 



DE Homo sapiens GD3e-a SS ociated protein (CAST) mRNA, complete cds. 
ri /P^otexn_id= n AAD4ai58.1 n 

"Q^^ECFNGRHVPLSGSQIVKGKIAGKRHRYRVLSSCPQAGEATI.LAPSTEAG 
SSJf^ GP ^ raP ^^EPLGVLPPSTTKKRKKPKGKETFEPEDKTVKQEQINTEPL 



FT 
FT 
FT 

FT 

FT 
KX 
3Q 



PGSTK3CRKKQSQESRMPETVPQEEMPGPPLNSESGEEAPTGRDKKRKQQQQQPV"^ 



'' 512 Af 502 C; 576 G; 251 T - 0 oth ^; 
aggagcccca 33ccggcggt gaggatgctg ctcggttctc ttgtcccccc 
ntllltZ? Cgaagccccc *9cctcagag tcccctcgtt tctccttgga ggcgctgacg 
KST C ? ga 9 ct 9t9 3Cttattcag gcccctgcag actttgcccc Igaftgcttc 
aa «^ 9 a !9 t 9 cctct ctctggctcc cagatcgtca agggcaaatt gXaggcaaa 
cggcaccgct atcgagtcct cagcagctgt ccccaagctg gagaagcgac cctScIoJcc 
a ? gCa " agg tggactcacc tgtgccLa^ ccccccaggg oacSctflgg 
atccttgagg gtccccagca atccctgtca gggagcoctc tgcagcccat cccaacaaa? 
cccccaccac agatccctcc tggcctgagg cctcigttct g?gcc?t3g gggclaccca 
ccagtcacag ggcctaggtc agccttggcc cccaacctgc ?cacctcaS Sa^ataaaa 
ctaaaaaS a "! gaCaga 9gccccagtc actcaggagg cagtgaatg £af g^cc 
aaffaSoc " 9gggtcgcca gaaatggatg tgcggaagla g^gllgaaa 

aaaaatcagc agctgaaaga accagaggca gcagggcctg tggggacaga gcccacaoto 
gagacactgg agcctctggg agtgctgttc ccgtccacci ccalSaSag SaglSccc 

gagfcfSS aSacacSt aagacagtga agcaggalcf gaSIa^ct 

T^Zlltl* 9 aagacaca 9t cctgtccccg accaaaaaga gaaagaggca aaaggggacq 
IctalllJ, 9 agCCa9a " a 99393tgaca gttgagtctc agccacaggt gaaggtgga? 
ItlTalatl? aa9CCatccc tctgccccct acgaagaaga ggaaaaalga LaggglcS 
atggcaatga tggagccagg gacggaggcg atggagccag tggagccgga gatgSJcc? 
ctggagtccc caggggggac catggcgcct caacagccai alggtgcXa gccfJalacc 
caggcagctc tggcagctcc caaaaagaag acgaagaaaj aaalacagca KaJgccfca 
TcllSlZl T a C o^, 9t 99t9999CCt 9 a 9ctgccgl atgacct?ga glctcaggca 
ccaattcaae tt^ 9 ? aagaa 9 aa 9 aagaaagaga gaggtcacac agtgaclgag 
™i v 9 cacta 9 a 9 cc tgaactgcca ggggagggac agcctgaagc cagggcaac? 
ccgggatcca ccaagaagag gaagaagcag agtcaggaaa gccggatgcc aglglcaSq 
9399 a9at9Cca 99 gccgccactg aattcagagt ctgggga|ga Specie! 
ggccgggaca agaagcggaa gcagcagcag cagcagcctg tg?S? c ?gc ScggSaa 
ctgaggaact aaagaaagct gaaggtgccc acctgggccl ccagLggfg acacccccaq" 
aa S^ CC ° Caga9act9C accagcgcag ccagcaggag cctggcc?^ gaggacgat? 
tcSacaaaa £ g ? g9gttt "ttggcagc tggggtcatc agglfact?? caagJagggc 
^ 9 ^ C <^ C £^T 9CC t99at " 9a9 ggagaaaaaa atglggllcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 




Homo sapiens Apo-2 ligand mRNA, complete cds. 

/ trans lat ion= " MAMMEVQGGPSLGQTCVLIVIFTVL 

QDKYSKSGIACFIjKEDDSYWDPNDEESMNSPCWQVKWQljRQIiVRKMILRTSEETISTVQ 
EKQQNI S PLVRERGPQRVAAHITGTRGRSNTLSSPNSKNTEKAIjGRKINS WESSRSGHS F 
LSNLHLRNGELVIHEKGF YYI YS QTYFRFQEE IKENTKNDKQMVQY I YKYTS YPDP ILL 
MKSARNSCWSKDAEYGLYSIYQGGIFELKE1TORIF 
it 

3'UTR 937. .1042 

Sequence 1042 BP; 348 A; 208 C; 232 G; 254 T; 0 other; 
tttcctcact gactataaaa gaatagagaa ggaagggctt cagtgaccgg ctgcctggct 
gacttacagc agtcagactc tgacaggatc atggctatga tggaggtcca ggggggaccc 
agcctgggac agacctgcgt gctgatcgtg atcttcacag tgctcctgca gtctctctgt 
gtggctgtaa cttacgtgta ctttaccaac gagctgaagc agatgcagga caagtactcc 
aaaagtggca ttgcttgttt cttaaaagaa gatgacagtt attgggaccc caatgacgaa 
gagagtatga acagcccctg ctggcaagtc aagtggcaac tccgtcagct cgttagaaag 
atgattttga gaacctctga ggaaaccatt tctacagttc aagaaaagca acaaaatatt 
tctcccctag tgagagaaag aggtcctcag agagtagcag ctcacataac tgggaccaga 
ggaagaagca acacattgtc ttctccaaac tccaagaatg aaaaggctct gggccgcaaa 
ataaactcct gggaatcatc aaggagtggg cattcattcc tgagcaactt gcacttgagg 
aatggtgaac tggtcatcca tgaaaaaggg ttttactaca tctattccca aacatacttt 
cgatttcagg aggaaataaa agaaaacaca aagaacgaca aacaaatggt ccaatatatt 
tacaaataca caagttatcc tgaccctata ttgttgatga aaagtgctag aaatagttgt 
tggtctaaag atgcagaata tggactctat tccatctatc aagggggaat atttgagctt 
aaggaaaatg acagaatttt tgtttctgta acaaatgagc acttgataga catggaccat 
gaagccagtt ttttcggggc ctttttagtt ggctaactga cctggaaaga aaaagcaata 
acctcaaagt gactattcag ttttcaggat gatacactat gaagatgttt caaaaaatct 
accaaaaca aacaaacaga aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



E Homo sapiens mRNA for annexin A13 (ANXA13 gene) , isof orm b 
T /protein_id=»CAC34622 .1" 

T / t r ans 1 at i on= n MGNRHS QS YTLSEGSQQIjPKGDS QPSTWQPIjSHP SRNGEPEAPQ 

1 PAKASS PQGPDVDRDAKKLNKACKGMGTNEAAI IE IL SGRTSDERQQ IKQKYKATYGKE 

kEEVLKSELSGNFEKTAIuALLDRPSEYAARQLQKAM^ 

AI KEAYQRLPDRSIiESDVKGDTSGNLKKI LVSLLQANRNEGDDVDKDLAGQDAKDIiYDA 

GEGRWGTDELAFNEVIiAirPSVTfnT.l? aTPnLVnTT.TmmTOD* rnnnmn^T 



T 
T 
T 
T 



. — ^uaow v iwjjj a ujMs.±ij vti JjJjgAjNTRNEGDD VDKDIjAGQDAKDIjYDA 
GEGRWGTDEIiAFNEVLAKRSYKQLRATFQAYQILIGKDIEEAIEEETSGDLQKAYIjTLV 

RCAQDCEDYFAERLYKSMKGAGTDEETLIRIVVTRAEVDLQGIKAKFQEKYQKSLSD1W 
RSDTSGDFRKLLVAIJVH " 



T RSDTSGDFRKLLVALLH" 

T exon 84.. 206 

T /gene="ANXA13 " 

Q Sequence 1588 BP; 484 A; 351 C; 410 G; 343 T; 0 other; 



ttaattaaag ttgtggatta tgcctcta 



a pans aaaari 


catctcagaa 


60 


C* 1~ c* A oa a cifT t~* 
u^ciycictyyvj 


agtcaacagt 


12 0 


tctcagccac 


ccafrsrcrna 

V-> »— d LvClUMMCt 




tcctcagggt 


tttgatgtgg 


240 


ggggaccaat 


gaagcagcca 


300 


acaaatcaag 


caaaagtaca 


360 


tgagctgagt 


ggaaacttcg 


420 


cgccgcccgg 


cagctgcaga 


480 


tgaggtcctg 


tgcacgagga 


540 


gctatttgat 


aggagcctcg 


600 


aatcctggtg 


tctctgctgc 


660 


agctggtcag 


gatgccaaag 


720 


gcttgcgttc 


aatgaagtcc 


780 


agcctatcaa 


attctcattg 


840 


cgacttgcag 


aaggcctatt 


300 


tgctgaacgt 


ctgtacaagt 


960 


catagtcgtg 


accagggccg 


1020 


gtatcagaag 


tctctctctg 


1080 


gctagtagcc 


ctcttgcact 


1140 


tttgtcaaga 


gcacattcca 


1200 


agagtcccgg 


attactttct 


1260 


taagggcagt 


aacgttaaga 


1320 


ctaggctgcc 


tcttcacttt 


1380 


aatcagcagt 


tgatgaaaaa 


1440 


acacaagtaa 


ggtacaggaa 


1500 


tcaatttgaa 


agcctcattg 


1560 




)E Homo sapiens serine protease inhibitor, Kazal type 1, mRNA (cDNA clone 



Sequence 362 BP; 121 A; 74 C; 75 G; 92 T; 0 other; 
cgcagaactt cagccatgaa ggtaacaggc atctttcttc tcagtgcctt ggccctgttg 
agtctatctg gtaacactgg agctgactcc ctgggaagag aggccaaatg ttacaatgaa 
cttaatggat gcaccaagat atatgaccct gtctgtggga ctgatggaaa tacttatccc 
aatgaatgcg tgttatgttt tgaaaatcgg aaacgccaga cttctatcct cattcaaaaa 
tctgggcctt gctgagaacc aaggttttga aatcccatca ggtcaccgcg aggcctgact 
ggccttattg ttgaataaat gtatctgaat atcaaaaaaa aaaaaaaaaa aaaaaaaaaa 



60 
120 
180 
240 
300 
360 



S compleS'cds. 3 ^ Pr ° te±n BLNK a ^ematively spliced, 

JE /translation "MDKLNKITVPASQKLRQIjQKWVHDIKNNEGGIMNKIKKIjKVKAPP 

^ SVPRRDYASESPADEEEQWSDDFDSDYENPDEHSDSEMYVMPAEENADDSYEPPPVEQE 

™, TRPVHPALPFARGEYIDNRSSQRHSPPFSKTLPSKPSWPSEKARLTSTLPALTALQKPO 

Z VPPKPKGLLEDEADYWPVEDNDENY1HPTESSSPPPEKAPMVNRSTKPNSSTPASPPG 

™ TASGRNSGAWETKSPPPAAPSPLPRAGKKPTTPLKTTPVASQQNASSVCEEKPIPAERH 

4 RGSSHRQEAVQSPVFPPAQKQIHQKP1PLPRFTEGGNPTVDGPLPSPSSNSTISEQEAG 

4 VLCKPWYAGACDRKSAEEALHRSNKDGSFLIRKSSGHDSKQPYTLVVTFNKRVYNIPVR 

g FIEATKQYAliGRKKNGEEYFGSVAE I IRNHQHSPLVLIDSQNNTKDSTRIiKYJWKVS " 

5Q Sequence 1806 BP; 571 A; 448 C; 379 G; 408 T; 0 other- 

rrt-frrtfrrrrn r*r*r*,~L~~ — j . ' 



aaaaaa 



acccttaaaa 


D U 


accactaoac 




aataaccatc 




aaacaataaa 




*»g L- y d 




tcractfctcrac 


O O V 




A O ft 


gccggttcac 


480 


gaggcattcc 


540 


agcaaggctc 


600 


caaacccaaa 


660 


tgaaaactat 


720 


gaatagatca 


780 


tcgaaacagt 


840 


acgggccggg 


900 


tgcttcaagt 


960 


cagacaagaa 


1020 


acccatacct 


1080 


cagcttttca 


1140 


gtatgctgga 


1200 


tggatcattt 


1260 


tgtattcttt 


1320 


atatgccttg 


1380 


caggaatcat 


1440 


caccagactg 


1500 


tgcttcagac 


1560 


tggattatga 


1620 


catttctttt 


1680 


tatctggtga 


1740 


aaaaaaaaaa 


1800 



Homo sapiens cDNA FLJ12768 fis, clone NT2RP2001576 , weakly similar to 
HYPOTHETICAL 62.2 KD PROTEIN C4G8 . 12C IN CHROMOSOME I. 

/protein_id«"BAB14263 . 1" 

/ translation " MQICGS S VAS VAAGTS FQVLGP VCWQQLDI*KMAVRVLWGGIiSIiIiR 
VLWCLiIjPQTGYVHPDEFFQSPEVMAEDILGVQAARPWEFYPS S S CRSVLFPLLI SGSTF 
WLLRIjWEELGPWPGLVSGYALLVGPRLLLTAL^ 

GS YVTIjVFYTRTFSNTIEGLLFTWIJjVLVS SHVTWGPTRKEPAPGPRWRS WIjLGG I VAA 

GFFNRPTFLAFAVVPLYLWGTRGATNPGLKSLTREALVL^ 

SPATSRmjVIiTPWFLHYNIiNPQNLARHGTHARIiTHLAW 

IjQVGLQAS AQMGIiLRALGARSIiLS S PRS YLLLL YFMPIiALIiS AFSHQEARFL I PLLVPL 
YliLCSPQTQPVPWKGTVVLFNALGALLFGCLHQGG 

TLLFTHTYMPPRHLLHLPGLGAPVEVVDMGGTEDWALCQTLKSFTRQPACQVAGGPWLC 
RLFVVTPGTTRRAVEKCSFPFKNETLIjFPHLTIjEDPPALSSLLSGAWRDHLSIjHIVEIjG 

BET" 

Sequence 2687 BP; 454 A; 883 C; 733 G; 617 T; 0 other; 

agtctccgcg ctgctgaggc gcgcccggcc gctcccacgg cctcccctcc gccctgcggt 60 

cccgccgcct ccggggcctc ctgggaccct ggccctcgcc gggcaggacg ccgccagcgc 120 

tgaaggcgca gcccggaggc cgcgcggatg cagatctgtg gatccagcgt agcatctgta 180 

gcagctggga catcattcca ggttttgggc ccggtgtgtt ggcaacaact ggatctgaag 240 

atggcagtca gggtgctttg gggtggtctc agcctgctcc gagtgctgtg gtgtctcctt ™o 
ccgcagacgg gctatgtgca cccagatgag ttcttccagt cccctgaggt gatggcagag 
gacatcctgg gcgttcaggc cgcgcggccc tgggagtttt accccagcag ctcctgccgc 

tcggtgctct tccccctgct gatctctggt tccaccttct ggctgctcag gctctgggag 4 80 

gagctggggc cgtggcctgg cctggtgagc ggctatgcgc tgctggtggg gcctcgactc 540 

ctcctcactg ccctttcctt tgctctggac ggggccgtgt accacctggc cccgccgatg 600 

ggggcggatc gctggaacgc cctggccctg ctgtctggtt cctacgtcac cctggtcttc 660 

tacacaagga ccttctccaa caccattgag ggactcctct tcacgtggct gctggtgctg 75 ^ 
gtatcctccc atgtaacgtg gggccctaca cgcaaggagc cggcgccggg tccacggtgg 
cgcagctggc ttcttggagg cattgtggct gctggcttct tcaaccggcc cacctttctg 

gcctttgctg tggtccccct ctacctctgg ggcactcgtg gagccacaaa ccctggtttg 900 

aagtctctga cccgggaggc cctggtgctg ctccctgggg cgaccctcac agcagcggtg 960 

tttgtggcca cggacagctg gtatttctcc agccccgcga catccaggaa ccttgtcctg 1020 

acacctgtca acttcctgca ctacaacctg aatccccaaa acctggcgag acatggcacg 1080 

cacgcgcggc tcactcacct ggcagtcaac ggcttcctgc tcttcggggt gctgcatgcc 1140 

caggccctgc aggctgcgtg gcaacagctg caagtcggcc tccaggcctc tgcacaaatg 1200 

ggcctcctga gggcactggg tgcccggagc ctgctgtcca gccccaggtc ctatctcctt 1260 

ctcctctact tcatgcctct ggccctgcta tctgccttta gccaccagga ggctcggttc 1320 

ctgattcccc tcctggtccc cctggtcctg ctttgtagtc cacagacgca gcctgtgcct 13 80 

tggaagggca ctgtggtcct cttcaacgcc ctcggtgccc tcctcttcgg ctgcctgcat 1440 

caggggggcc tggtgcctgg cctggagtac ctggagcagg tggtccatgc ccctgtgctc 1500 

ccaagcacac ccacccacta cacactcctc ttcactcaca cctacatgcc cccccggcac 1560 

ctcctacacc tcccaggcct gggggcacca gtggaggtgg tggacatggg ggggactgag 1620 

gactgggccc tgtgccaaac cctgaaaagc ttcaccagac aaccagcctg ccaagtggct 1680 

ggtgggccat ggctctgccg cctctttgtg gtaacccctg gcaccaccag gcgtgccgtg 1740 

gagaagtgca gcttcccctt caagaatgaa acacttttat ttccccatct gaccctggag 1800 

gatccaccag ccctgtcctc cttgctgagt ggggcttgga gggaccacct cagtcttcac 1860 

attgtggagc tgggggaaga aacctgacaa tatgacagag cacccactgc ccaagactca 1920 

gccatagaag atgccgcccc accttctact tgggtagctg ggctgggacg ctgggacagg 1980 

accccgtcct ccttcatgac tcccactgct gcctctcctg ggcatggctg ttagctgttc 2040 

tgccttctgg gtgagctggc actcttctcc ctgagaccaa agatttgacc tgtcggtctt 2100 

gatgtcaagg tccccaaaga ccaggttaag tgacgacacc tgtctgttcc tgccctgttg 2160 

cttccagcca ctgtgatgtt tgaaatatgt gatagtacct ggttgtgaaa aaagacaatg 2220 

aactgctagt gacattcctc aatgacctct cccaaacctc ccatgatgcc ttacccttgc 22 8 0 
tgtcatgaca accctctggc ttcctaagac ccatctgcct atcgaaatat gtgcaagtca 
gtgagacgaa gtatagagaa caggtggccc agatccaggg gacccaactt ctggcccctt 



300 
360 
420 



720 
780 
840 



2280 
2340 
2400 



ggtctgtcac ctcctcgctg tgtgatcttg 
tcccagtctg ttgggatcag aggaactttg 
ctggagtcca cagtacacgt cccctgcctc 
gcatggaagt gggagttgtg ttgtacttca 
tggttattat gcaaacaagt aatgtttgaa atatataata gcactgg 



agaaagctcc ttccactcac ccaccccact 
aggtgtctgc cggctaacat tgtgtcattc 
aacaggcaca gctctcacaa agctcttcaa 
tggcactctg atgcctgctg tctcagtgtt 



Homo sapiens glycine amidinotransf erase (L-arginine: glycine 
amidinotransf erase) , mRNA (cDNA clone MGC: 1744 IMAGE : 3010128) . complete 

/protein id="AAH04141.1" 

/trans 1 at ion= - MI^VRCLRGGSRGAEAVHYIGSRLGRTLTGWVQRTFQSTQAATAS 
SRNSCAADDKATEPLPKDCPVSSYNEWDPLEEVIVGRAENAC^PPPTIEVKANTYEKYW 
PFYQKQGGHYFPKDHIiKKAVAEIEEMCTILKTEGVTVRRPDPIDWSLKYKTPDFESTGL 
YSAMPRDILIVVGNEIIEAPMAWRSRFFEYRAYRSIIKIIYFHRGAKWTTAPKPTMADEIj 

YNQD YPIHS VEDRHKLAAQGKFVTTEFEPCFD AADF I RAGRD I F AQRSQVTNYLG IEWM 
RRHLAPDYRVHIISFKDPNPMHIDATFNIIGPGIVLSNPDRPCHQIDLFKKAGWTIITP 

PTP 1 1 PDDHPLWMS SKWLSMNVLMLDEKRVMVDMIEVP I QKMFEKLG ITTI KVNIRNAN 

SLGGGFHCWTCDVRRRGTLQSYLD " 

Sequence 2342 BP; 690 A; 490 C; 480 G; 682 T; 0 other; 
cgggaaggct tggaccgacg cggcccagag gccaggaaca ttccgcgcgt ggaccagccg 
ggccagggcg atgctgcggg tgcggtgtct gcgcggcggg agccgcggcg ccgaggcggt 
qcactacatc ggatctcggc ttggacgaac cttgacagga tgggtgcagc gaactttcca 
gagcacccag gcagctacgg cttcctcccg gaactcctgt gcagctgacg acaaagccac 
tgagcctctg cccaaggact gccctgtctc ttcttacaac gaatgggacc ccttagagga 
agtgatagtg ggcagagcag aaaacgcctg tgttccaccg ttcaccatcg aggtgaaggc 
caacacatat gaaaagtact ggccatttta ccagaagcaa ggagggcatt attttcccaa 
agatcatttg aaaaaggctg ttgctgaaat tgaagaaatg tgcaatattt taaaaacgga 
aggagtgaca gtaaggaggc ctgaccccat tgactggtca ttgaagtata aaactcctga 
ttttgagtct acgggtttat acagtgcaat gcctcgagac atcctgatag ttgtgggcaa 
tgagattatc gaggctccca tggcatggcg ttcacgcttc tttgagtacc gagcgtacag 
gtcaattatc aaagactact tccaccgtgg cgccaagtgg acaacagctc ctaagcccac 
aatggctgat gagctttata accaggatta tcccatccac tctgtagaag acagacacaa 
attggctgct cagggaaaat ttgtgacaac tgagtttgag ccatgctttg atgctgctga 
cttcattcga gctggaagag atatttttgc acagagaagc caggttacaa actacctagg 
cattgaatgg atgcgtaggc atcttgctcc agactacaga gtgcatatca tctcctttaa 
agatcccaat cccatgcata ttgatgctac cttcaacatc attggacctg gtattgtgct 
ttccaaccct gaccgaccat gtcaccagat tgatcttttc aagaaagcag gatggactat 
cattactcct ccaacaccaa tcatcccaga cgatcatcca ctctggatgt catccaaatg 1140 
gctttccatg aatgtcttaa tgctagatga aaaacgtgtt atggtggatg ccaatgaagt 
tccaattcaa aagatgtttg aaaagctggg tatcactacc attaaagtta acattcgtaa 
tgccaattcc ctgggaggag gcttccattg ctggacctgc gatgtccggc gccgaggcac 
cttacagtcc tacttggact gaacaggcct gatggagctt gtggctggcc tcagatacac 
ctaagaagct taggggcaag gttcattctc ctgctttaaa aagtgcatga actgtagtgc 
tttaaacaat catctcctta acaggggtcg taagcctggt ttgcttctat tacttttctt 
tgacataaag aaaataactt ctgctaggta ttactctcta ctcctaaagt tatttactat 
ttggcttcaa gtataaaatt ttggtgaatg tgtaccaaga aaaaattagt cacctgagta 
acttggccac taataattaa ccatctacct ctgtttttaa ttttctttcc aaaaggcagc 
ttgaaatgtt ggtcctaatc ttaatttttt ttcctcttct atagacttga gaatgttttt 
ctctaaatga gagaaagact tagaatgtac acagatccaa aatagaatca gattatctct 
ttttttctaa aggagagaaa gacttagaac atacacagat cctaagtaga accaggtaat 
tgtctctttt tctaataagg aatttgggta atttttaatt ttttgttttt taaaaaataa 
cctagactat gcaaaacatc aaagtgaatt ttccatgaat gtttttaata ttctcatctc 
aacattgtga tatatgctac taaaaacctt ttcatataca tcttacctca tttcaagtga 
attattttaa tctttttctc tctttccaaa aatttaggaa tgtttagtgt aattggattt 
cgctatcagt tcccatcctt aagttttgat attcaatatc tgatagatac actgcatctt 
tggtcatcta agatttgttt acaaatgtgc aaattattta gagcatagac tttataagca 
ttaaaaaaaa ctaatggagg taaaacctaa atgcgatgtg aaataatttt agtgttgata 
ccgtatgtgt atttttattc taataaactt ttgtgttcca gaaaaaaaaa aaaaaaaaaa 234Q 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 



aa 



Z {^5^ Slation = n MGTQEGWCLLLCIALSGAAETKPHPAEGQWRAVDWLDCFLAKDQ 

T ^GA^SEDRARASLVLKQVPVLDDGSLEDFTDFQGGTLAQDDPPIIFEASVDLVQI 

T PQAEAIiLHADCSGKEVTCE ^^^ F ^MTETTVKTAAWFMANVQVSGRGPS ISLVMKTPR 

4 Y^^^ T ^ PLSP QGTVRTAVEFQVm'QTQSLSFLLGSSASI^CGFSMAPGLDL 

ISV^RLQHKGRGQLVYSWTAGQGQAVRKGATLEPAQLGMARDASLTLPGLTIQDEGTY 

T ^^ AQVSGASFSSLR Q SV AGTYSISSSLTAEPGSAGATYTCQVTHISLEEPLGASTQV 

T ^^^ GVIF ^ SLFL ^ MFLGL Q^QAPTGLGLLQAERWETTSCADTQSSHLHE 



SISSS! aca 393aaga aacctaaagg ctgcaggctg ccaggtgtgc ttggagagcc 60 

cccttcttcc gccgggcctc gcaagcagcg taggactgtg gagaagggcg gtoggLagg 120 

SSZES a939Ca9CCt c «^99cac acaggaglgc ?ggtg«?gc JgSctgc?? ^ 

ggctctatct ggagcagcag aaaccaagcc ccacccagca gaggggcagt ggcgggcaat 240 

222E 0 CtagaCt9Ct tcctggcgaa ggacggtgcg caSgggag SSSJSS 30S 

o a I^ a f 9aC a9g f" a 999 cctcccttgt gctgaagcag gtgccagfg? tggacgltgg l 60 

f^"!? 939 9 acttca «9 atttccaagg gggcacactg gcccaagalg alccacctl? 420 

tatctttgag gcctcagtgg acctggtcca gattccccag gccgaggcct tgctccatgc tlo 

SccSS ! 9gaa99agg tg«cctgtg« gatctcccgc ?ac?t?ftcc agltgacagl m! 

gaccactgtt aagacagcag cttggttcat ggccaacgtg caggtctctg gacqqqqacc III 
gaS^:: tSSKT agaCtCCCag ^cgccLg aa?gaggcg? 

gacgctgaac ttgccactga gcccccaggg gactgtgcga actgcagtgg agttccaggt 720 

? a tc a c« a I aCCCaatCCC tgagcttcct gctggggtcc tcOccScS tggactggg l S0 

cttctccatg gcaccgggct tggacctcat cagtgtggag tggcgactgc agcacaam 840 

ttg ^ taca gctggaccgc agggcagggl cajgctgtgc ££££ lol 

SSS 9 CCtgcacaac tgggcatggc cagggatgcc tcHtcLSc Sgcccggcct 9sS 

o-™^? gaCga "99 a cctacatttg ccagatcacc acctctctgt aLgagctca 102S 

gcagatcatc cagctcaaca tccaagcttc ccctaaagta cgactgagct tggcaaacal loeo 
SSST" CCCaCCCtCa tctgcgacat tgctggcLt taccc?c?gg 

gacgtggacc cgagaggagc tgggtggatc cccagcccaa gtctctgglg cctccttcS 1200 

S££2? Caaagcgtgg <Wccta cagcatctcc tcctctSc! ccgcagaacc lleo 

9 9gtgccactt a cacctgcca ggtcacacac atctctctgg aggaglccct 1320 

cagSatSc SSS? tc - a -aga gcggagaaca gccttgggag tStctttgc lllo 

agSc?taao llltll 9 Cact 9 at 9 tt ^ctggggctt cagagacggc aagcacctac 1440 

a ? 9 ccatc?c cfc 9 aac 9 ct 9 39agaccact tcctgtgctg acacacagag 1500 

atgaaactS t»l»l 9 gcacagcgcg tgtaagccag cccagctgac ctaaagcgac 1560 

atgagactac tagaaagaaa cgacaccctt ccccaagccc ccacagctac tccaacccaa 162 0 

^SSST" a9CCa9ttta at 9 3 taggaa tttgtatttt ttgcc?ttgt tcagaataS JSJ 




DE Homo sapiens leucine amine-peptidase 3, mRNA (cDNA clone IMAGE:2821948) , 
partial cds. 

/ trans 1 at ion= "IiAVRRFGSRSLSTADMTKGLVIiGIYSKEKE3)DVPQFTSAGENFDK 
LLAGKI1RETI1NISGPPLKAGKTRTFYGLHQ 

IRAAVAAGCRQI QDLELS S VEVDPCGDAQAAAEGAVIjGIjYE YDDLKQKKKMAVSAKLYG 
SGDQEAWQKGVLFASGQNIARQIiMETPANEMTPTRFAEI IEKNLKSAS SKTEVHIRPKS 
WIEEQAMGSFLSVAKGSDEPPVFIiEIHYKGSPNANEPPIjVFVGKGITFDSGGISIKASA 
NMDLMRADMGGAATI CS AI VS AAKLNLPINI IGLAPLCENMPSGKANKPGDVVRAKNGK 
TIQVDNTDAEGRLILADALCYAHTFNPKV^ 

NKLFEAS IETGDRWRMPLFEHYTRQVVDCQLADVNNIGKYRSAGACTAA 
PKWAHLDI AGVMTXsTKDEVPYLRKGMTGRPTRTLIEFIjIiRFSQDNA " 



Sequence 1938 BP; 603 A; 386 C; 470 G; 479 T; 0 other; 

gtctggccgt gagaegttte gggagccgga gtctctccac cgcagacatg aegaagggee 60 

ttgttttagg aatctattcc aaagaaaaag aagatgatgt gccacagttc acaagtgcag 120 

gagagaattt tgataaattg ttagctggaa agctgagaga gactttgaac atatctggac 180 

cacctctgaa ggcagggaag actcgaacct tttatggtct gcatcaggac ttccccagcg 240 

tggtgctagt tggcctcggc aaaaaggcag ctggaatcga cgaacaggaa aactggcatg 300 

aaggcaaaga aaacatcaga gctgctgttg cageggggtg caggcagatt caagacctgg 360 

agctctcgtc tgtggaggtg gatccctgtg gagaegctea ggctgctgcg gagggagegg 420 

tgcttggtct ctatgaatac gatgacctaa agcaaaaaaa gaagatggct gtgtcggcaa 480 

agctctatgg aagtggggat caggaggect ggcagaaagg agtcctgttt gcttctgggc 540 

agaacttggc aegecaattg atggagacgc cagecaatga gatgacgeca accagatttg 600 

ccgaaattat tgagaagaat ctcaaaagtg ctagtagtaa aaccgaggtc catatcagac 660 

ccaagtcttg gattgaggaa caggcaatgg gatcattcct cagtgtggcc aaaggatctg 720 

acgagccccc agtcttcttg gaaattcact acaaaggcag ccccaatgca aacgaaccac 780 

ccctggtgtt tgttgggaaa ggaattacct ttgacagtgg tggtatctcc atcaaggctt 840 

ctgeaaatat ggacctcatg agggctgaca tgggaggagc tgcaactata tgctcagcca 900 

tcgtgtctgc tgeaaagett aatttgecca ttaatattat aggtctggee cctctttgtg 960 

aaaatatgee cageggcaag gccaacaagc egggggatgt tgttagagcc aaaaacggga 1020 

agaccatcca ggttgataac actgatgetg aggggaggct catactggct gatgegctet 1080 

gttaegcaca caegtttaac ccgaaggtca tcctcaatgc cgccacctta acaggtgeca 1140 

tggatgtagc tttgggatca ggtgccactg gggtctttac caattcatcc tggctctgga 1200 

acaaactctt cgaggccagc attgaaacag gggaccgtgt ctggaggatg cctctcttcg 1260 

aacattatac aagacaggtt gtagattgee agettgetga tgttaacaac attggaaaat 1320 

acagatctgc aggagcatgt acagctgeag cattcctgaa agaattegta actcatccta 1380 

agtgggcaca tttagacata gcaggcgtga tgaccaacaa agatgaagtt ccctatctac 144 0 

ggaaaggcat gactgggagg cccacaagga ctctcattga gttcttactt cgtttcagtc 1500 

aagacaatgc ttagttcaga tactcaaaaa tgtcttcact ctgtcttaaa ttggacagtt 1560 

gaacttaaaa ggtttttgaa taaatggatg aaaatctttt aacggagaca aaggatggta 1620 

tttaaaaatg tagaacacaa tgaaatttgt atgecttgat ttttttttca tttcacacaa 1680 

agatttataa aggtaaagtt aatatcttac ttgataagga tttttaagat actctataaa 1740 

tgattaaaat ttttagaact tcctaatcac ttttcagagt atatgttttt cattgagaag 1800 

caaaattgta actcagattt gtgatgetag gaacatgagc aaactgaaaa ttactatgea I860 

cttgtcagaa acaataaatg caacttgttg tgctcaaaaa aaaaaaaaaa aaaaaaaaaa 1920 
aaaaaflflflfl? aaaaaaaa 



sapiens mRNA for protein phosphatase 4 regulatory subunit 2 (PPP4R2 



/translations "MCQAPCWRAGGSGLGRCSLCRSCSLARFPRLPSFPPPGRIiRAGVC 
AREGEGVGGVGGGVPVPKRPAEGGGGCEGLREAMDVEMjQEAIiKDFEKRGKKEVCPVIiD 
QFLCHVAKTGETMIQWSQPKGYFIFKLEKVMDDFRTSAPEPRGPPNPNVEYIPFDEMKE 
RILKIVTGFNGIPFTIQRIiCELIiTDPRRNYTGTDKFLRGVEKNVMVArSCVxPSSERNMS 
NSLNRMNGVMFPGNAPSYTERSNINGPGTPRPRNRPKVSLSAPMTTNGWPESTDSKEAN 



TTSSEISSVMVGETEASSSSQDKDKDSRCTRQHCTEEDEEEDEEEEEESFMTSREM1PE 
RKNQEKESDDALTVNEETSEENNQMEESDVSQAEKDLLHSEGSENEGPESKWFF" 



T 
T 
T 
T 
T 
T 
T 
T 
X 

Q Sequence 2049 BP; 651 A; 409 C; 506 G; 483 T; 0 other- 

actgtacaaa tgctttattt ctattcaata tttagaagac agttataaac aagatgcatt 60 
caatagcatg gtggcagatg aacatcagga aggaacatcc atgagcttcc atccacggaa 120 
cctcaccatg gatacgcttg tgatcaaggg cctggtctcc cctcaagaca cggtcacaga 180 
tcagaggcca caccatccta gcagtggagc agtaccagct gggacagggt ccttctgtga 240 
cacctgctgc atcaccaggc tgggtgaacg gacacaattg ccagaactca cagaatagaa 300 
gtatcagcac cgaaacctca caggaaaaat ggtaagttct aagtttctcc attaatagta 360 
actctcagat taatctctgt catccatcgc ttctccaaga aatgactttt tagggtgatg 420 
tgccaggcgc catgttggag ggctggtggt agcggcttgg ggaggtgctc actctgtcgg 480 
tcttgctctc tcgcacgctt cccccggctc ccttcgtttc ccccccccgg tcgcctgcgt 54 o 

gccggagtgt gtgcgaggga gggggagggc gtcggggggg tggggggagg cgttccggtc 600 
cccaaaagac ccgcggaggg aggcggaggc tgtgagggac tccgggaagc catggacgtc 660 
gagaggctcc aggaggcgct gaaagatttt gagaagaggg ggaaaaagga agtttgtcct 720 
gtcctggatc agtttctttg tcatgtagcc aagactggag aaacaatgat tcagtggtcc 780 
caatttaaag gctattttat tttcaaactg gagaaagtga tggatgattt cagaacttca 840 
gctcctgagc caagaggtcc tcccaaccct aatgtcgaat atattccctt tgatgaaatg 
aaggaaagaa tactgaaaat tgtcactgga tttaatggta tcccttttac tattcagcga 
ctatgtgaat tgttaacaga tccaaggaga aactatacag gaacagacaa atttctcaga 1020 
?™f! a ^ a 9 aac 9 tgat 33ttgttagc tgtgtttatc ottcttcaga gagaaacaat 1080 
tccaatagtt taaatcgaat gaatggtgtg atgtttcctg gaaatgcacc aagctatact 
gagaggtcta atataaatgg gcctgggaca cccaggccac gtaatcgacc aaaggtttct 
ctgtcagccc ccatgacaac aaatgggtgg cctgagagca cagacagcaa agaggcaaat 
ttgcagcaaa atgaggagaa aactcacagt gactcttcga catctgaatc agaagtttcc 1320 
«!o^ 9a9CC cttt 9 agaaa taaacatcca gatgaagatg ctgtggaagc tgaggggcat 1380 
gaggtaaaaa gactcaggtt tgacaaagaa ggtgaagtca gagaaacagc cagtcaaacg 1440 
acttccagcg aaatttcttc agttatggta ggagaaacag aagcatcatc ttcatctcag 15 00 

gataaagaca aagatagccg ttgtacccgg cagcactgta cagaagagga tgaagaagag 1560 
gatgaagagg aagaagaaga gtcttttatg acatcaagag aaatgatccc agaaagaaaa 1620 
aatcaagaaa aagaatctga tgatgcctta actgtgaatg aagagacttc tgaagaaaat 1680 
aatcaaatgg aggaatctga tgtgtctcaa gctgagaaag atttgctaca ttctgaaggt 1740 
S! aaaaC9 aa 9? ccct 9 a aagtaagtgg ttcttctgac tgccgtgaaa cagaaaaatt 1800 
agtaggaacc aattcccagt aaaactggaa agaatctttc cagaatcatc ccatggataa i860 
tgatgacgaa gccacagaag tcaccgatga accactggaa caagactatt tagaaacatt 1920 
tacatgcagt attttacaca cagttctggt tttaacactg tataaaactt ttatgtaaaa 1980 
aagtgcacct ttagttttac aagtaaagca ggttgtaaaa taaagtactt tatggataat 2040 



900 
960 



1140 
1200 
1260 




Human mRNA for <2'-5») oligo A synthetase E (1,6 kb RNA) 

/ trans 1 at i on= " MMDLRNTPAKSLDKFIED YLLPDTCFRMQIDHAIDI I CGFLKERC 
FRGS S YPVCVS K\ATCGGS SGKGTTIjRGRSDADIiVVFIiS PLTTFQDQLNRRGEF IQEIRR 
QLEACQRERALSVKFEVQAPRWGNPRALSFVliSSIiQLGEGVEFDVIiPAFDAIiGQLTGSY 

KPNPQIYVKLIEECTDLQIOEGEFSTCFTEIiQRDFIiKQRPTKLKSLIRIiV^ 
LGKLPPQYALELLTVYAWERGSMKTHFNTAQGFRTV^ 
IIEKYIiRRQLTKPRPVILDPADPTGNLGGG 
S S WILLVRPPAS SLPFI PAPLHEA " 



Sequence 1322 BP; 334 A; 353 C; 320 G; 315 T; 0 other; 
gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 
aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 
atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 
tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 
ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 
gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 
caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 
cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 
ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 
tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 
ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 
cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct gccacctcag 
tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa aacacatttc 
aacacagccc aaggatttcg gacggtcttg gaattagtca taaactacca gcaactctgc 
atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta cctgagaagg 
cagctcacga aacccaggcc tgtgatcctg gacccggcgg accctacagg aaacttgggt 
ggtggagacc caaagggttg gaggcagctg gcacaagagg ctgaggcctg gctgaattac 
ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct ggtgagacct 
cctgcttcct ccctgccatt catccctgcc cctctccatg aagcttgaga catatagctg 
gagaccattc tttccaaaga acttacctct tgccaaaggc catttatatt catatagtga 
caggctgtgc tccatatttt acagtcattt tggtcacaat cgagggtttc tggaattttc 
acatcccttg tccagaattc attcccctaa gagtaataat aaataatctc taacaccaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



DE Homo sapiens A-kinase anchoring protein 18 beta tnRNA, complete cds. 

PT /translation "MGQLCCFPFSRDEGKISELESSSSAVLQRYSKDIPSWSSGEKNGG 

FT EPDDAELVRLSKRLVENAVLKAVQQYLEETQNKNKPGEGSSVKTEAADQNGNDNENNRK 
KX 

3Q Sequence 463 BP; 139 A; 106 C; 132 G; 86 T; 0 Other- 

gctcgcagac tgtgctataa actgcaattt ctatttgggg tcctcacgga gaagaacacc 60 

aggaaagaca gacaggacca gtgccatggg ccagctttgc tgctttcctt tctcaagaga 120 

tgaaggaaaa atcagtgagt tggaaagctc gtcctctgca gtcctacaaa gatacagcaa 180 

ggatataccc agttggtcaa gtggtgaaaa gaacggaggg gagcccgatg acgctgaact 240 

agtaaggctc agtaagaggc tggtggagaa cgcggtgctc aaggctgtcc agcagtatct 300 

ggaggaaaca cagaataaaa acaagccggg ggaggggagc tctgtgaaaa ccgaagcagc 360 

w t t 9f tca 9 aat ggcaatgaca atgagaacaa caggaaatga gcccggaacg caggccccca 420 

tgtctctgtg caaagcctcc ctgcttccct ctgctgagtc tag 




Homo sapiens peptidyl prolyl isomerase H (cyclopnilin H) , tnRNA (cDNA clone 

/trans lat ion= "MAVANS SPVNPVVFFDVS IGGQEVGRMKIELFADVVPKTAENFRQ 
FCTGEFRKDGVPIGYKGSTFHRVIKDFMI QGGDFVNGDGTGVAS IYRGPFADENFKLRH 
SAPGLLSMANSGPSTNGCQFFITCSKCTVHjDGKHNA^ 
KPKLPWISQCGEM" 

Sequence 765 BP; 199 A; 156 C; 200 G; 210 T; 0 other; 
cttctgcttc cgggtcggag ccatggcggt ggcaaattca agtcctgtta accccgtggt 
gttctttgat gtcagtattg gcggtcagga agttggccgc atgaagatcg agctctttgc 
agacgttgtg cctaagacgg ccgagaactt taggcagttc tgcaccggag aattcaggaa 
agatggggtt ccaataggat acaaaggaag caccttccac agggtcataa aggatttcat 
gattcagggt ggagattttg ttaatggaga tggtactgga gtcgccagta tttaccgggg 
gccatttgca gatgaaaatt ttaaacttag acactcagct ccaggcctgc tttccatggc 
gaacagtggt ccaagtacaa atggctgtca gttctttatc acctgctcta agtgcgattg 
gctggatggg aagcatgtgg tgtttggaaa aatcatcgat ggacttctag tgatgagaaa 
gattgagaat gttcccacag gccccaacaa taagcccaag ctacctgtgg tgatctcgca 
gtgtggggag atgtagtcca gacaaagact gaatcaggcc ttcccttctt cttggtggtg 
ttcttgagta agataatctg gactggcccc cgtctttgct tccctgcctg ctgctgcccc 
atttgatcaa gagaccatgg aagtgtcaga gattcagaat ccaagattgt ctttaagttt 
:aactgtaa ataaagtttt tttgtatgcg taaaaaaaaa aaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



H cds° SapienS mRNA; CDNA DKFZ P564C0362 (from clone DKFZp564C0362) ; complete 



/translation^ "IWGKGKSNSSAVPSDSQAREKL^ 

IRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEAKAFH^ 

GNIPPGDGMPVGPVPPGFFQPFMSPRYPGGPRPPLRIPNQALGGVPGSQPLLPRGMDPT 

RQQGHPNMGGPMQRMTPPRGIWPLGPQNYGGAMRPPLNi^GGPGMPGMNMGPG 

NPTNANS IPYSSAS PGNYVGPPGGGGPPGTP IMPS PADSTNSGDNMYTLMNAVPPGPNR 

PNFPMGPGSDGPMGGLGGMESHHMNGSLGSGDMDSISKNSPNNMSLSNQPGTPRDDGEM 
GGNFLNPFOSES YS PfiMTMfiV n 



FT 
FT 
FT 
FT 
FT 
FT 

?T GGNFLNPFQSES YS PSMTMS V 

?T polyA sicmal 1685.. 1690 
FT polyA site 1711 
CX 
3Q 



Sequence 1731 BP; 513 A; 385 C; 392 G; 441 T; 0 other; 



— ■ —nrJ •-^J'-* «^uuayauy u Ld 

taggaataaa atcgcttatt tgagagcttt caaaaaaaaa aaaaaaaaaa c 





ggcacaagca 


60 




gcccgggaga 


120 


agctcagaaa 


tcaoctcaaa 


1DU 


gggggaacca 


ccaggattct 


240 


agctccagag 


agacgtgaaa 


300 


cagtgctgca 


gcagctccca 


360 


agtaggtcct 


gtaccaccag 


420 


tccaaggccc 


ccattgagga 


480 


attactcccc 


agaggaatgg 


540 


aatgcagaga 


atgactcctc 


600 


tgcaatgaga 


cccccactga 


660 


tccaggtggt 


ggtagacctt 


720 


agcatctcct 


gggaattatg 


780 


catgcctagt 


ccagcagatt 


840 


agtacctcct 


ggacctaaca 


900 


gggtggatta 


ggaggaatgg 


960 


ggacagtatt 


tccaagaatt 


1020 


aagggatgat 


ggcgaaatgg 


1080 


ccctagcatg 


acaatgagcg 


1140 


gtcagccctt 


cacagaacta 


1200 


aaggaatctc 


agtcacacca 


1260 


aagaaagcgg 


gtccagatgt 


1320 


taaactgaac 


tgcaaataat 


1380 


atatgtgtat 


gttatacaga 


1440 


atattatcac 


atgacatctt 


1500 


cacgtttaca 


ttttaattat 


1560 


gtgtatatca 


ctgctttata 


1620 


tatatgtctg 


caatcatgga 


1680 



Human interferon- induced cellular resistance mediator protein (MxB) mRNA, 
complete cds . 

/translation="MSKAHKPWPYRRRSQFSSRKVljKKEMNSFQQQPPPFGTVPPQMMF 

pPNWQGAEKDAAFLAKDFNFLTLNNQPPPGm^ 
DSLRALGVEQDIiALPAXAVIGDQSSGKSSVLEALSGVJ^ 

CEAWAGRI S YKNTEIiELQDPGQVEKEIHKAQNVMAGNGRG I SHELI SLE I TS PEVPDLT 

IIDLPGITRVAVDNQPRDIGIjQIKALIKKYIQRQQTIN^ 

VDPEGDRTIGILTKPDLMDRGTEKSVMNVVRNLTYPLKKGYMIW 

EATKKEITFFQTHPYFRVLLEEGSATVPRLAERIjTTEIjIMHIQKSIjPLLEGQIRESHQK 
ATEELRRCGADIPSQEADKMFFLIEKIKMFNQDIEKLVEGEEWRENETRIiYNKIREDF 

KNWGILATNTQKVKNIIHEEVEKYEKQ^ 

SMLQKAI^IIQQAFINVAKKHFGEFFNLNQWQSTIEDIKVKHTAKAENMIQLQ 
MVFCQDQIY S VVLKKVREBIFNPIiGTPSQNMKIjNSHFPSNES SVS SFTEIGIHIjNAYFL 
ETSKRLANQIPFIIQYFMIiRENGDSIjQKAN^^ 
RI YRLTQARHALCQFS SKEIH " 



Sequence 2961 BP; 826 A; 754 C; 721 G; 660 T; 0 other; 
aagagatgat ttctccatcc tgaacgtgca gcgagcttgt caggaagatc ggaggtgcca 
agtagcagag aaagcatccc ccagctctga cagggagaca gcacatgtct aaggcccaca 
agccttggcc ctaccggagg agaagtcaat tttcttctcg aaaatacctg aaaaaagaaa 
tgaattcctt ccagcaacag ccaccgccat tcggcacagt gccaccacaa atgatgtttc 
ctccaaactg gcagggggca gagaaggacg ctgctttcct cgccaaggac ttcaactttc 
tcactttgaa caatcagcca ccaccaggaa acaggagcca accaagggca atggggcccg 
agaacaacct gtacagccag tacgagcaga aggtgcgccc ctgcattgac ctcatcgact 
ccctgcgggc tctgggtgtg gagcaggacc tggccctgcc agccatcgcc gtcatcgggg 
accagagctc gggcaagagc tctgtgctgg aggcactgtc aggagtcgcg cttcccagag 
gcagcggaat cgtaaccagg tgtccgctgg tgctgaaact gaaaaagcag ccctgtgagg 
catgggccgg aaggatcagc taccggaaca ccgagctaga gcttcaggac cctggccagg 
tggagaaaga gatacacaaa gcccagaacg tcatggccgg gaatggccgg ggcatcagcc 
atgagctcat cagcctggag atcacctccc ctgaggttcc agacctgacc atcattgacc 
ttcccggcat caccagggtg gctgtggaca accagccccg agacatcgga ctgcagatca 
aggctctcat caagaagtac atccagaggc agcagacgat caacttggtg gtggttccct 
gtaacgtgga cattgccacc acggaggcgc tgagcatggc ccatgaggtg gacccggaag 
gggacaggac catcggtatc ctgaccaaac cagatctaat ggacaggggc actgagaaaa 
gcgtcatgaa tgtggtgcgg aacctcacgt accccctcaa gaagggctac atgattgtga 
agtgccgggg ccagcaggag atcacaaaca ggctgagctt ggcagaggca accaagaaag 
aaattacatt ctttcaaaca catccatatt tcagagttct cctggaggag gggtcagcca 
cggttccccg actggcagaa agacttacca ctgaactcat catgcatatc caaaaatcgc 
tcccgttgtt agaaggacaa ataagggaga gccaccagaa ggcgaccgag gagctgcggc 
gttgcggggc tgacatcccc agccaggagg ccgacaagat gttctttcta attgagaaaa 1380 
tcaaaatgtt taatcaggac atcgaaaagt tagtagaagg agaagaagtt gtaagggaga L440 
at gagacccg tttatacaac aaaatcagag aggattttaa aaactgggta ggcatacttg 
caactaatac ccaaaaagtt aaaaatatta tccacgaaga agttgaaaaa tatgaaaagc 
agtatcgagg caaggagctt ctgggatttg tcaactacaa gacatttgag atcatcgtgc 
atcagtacat ccagcagctg gtggagcccg cccttagcat gctccagaaa gccatggaaa 
ttatccagca a gctttcatt aacgtggcca aaaaacattt tggcgaattt ttcaacctta 
accaaactgt tcagagcacg attgaagaca taaaagtgaa acacacagca aaggcagaaa 
acatgatcca acttcagttc agaatggagc agatggtttt ttgtcaagat cagatttaca 
gtgttgttct gaagaaagtc cgagaagaga tttttaaccc tctggggacg ccttcacaga 
atatgaagtt gaactctcat tttcccagta atgagtcttc ggtttcctcc tttactgaaa 1980 
taggcatcca cctgaatgcc tacttcttgg aaaccagcaa acgtctcgcc aaccagatcc 2040 
catttataat tcagtatttt atgctccgag agaatggtga ctccttgcag aaagccatga 2100 
tgcagatact acaggaaaaa aatcgctatt cctggctgct tcaagagcag agtgagaccg 2160 
ctaccaagag aagaatcctt aaggagagaa tttaccggct cactcaggcg cgacacgcac 222 0 

tctgtcaatt ctccagcaaa gagatccact gaagggcggc gatgcctgtg gttgttttct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



2220 
2280 



tgtgcgtact cattcattct aaggggagtc 
aaactcttct gtcactatca gtgtccatct 
atcaggggtc cacacaggct cagctctctc 
agggatggct ctccagtcct tgggtcccgt 
ctatcattct tcgctaattt gtatttgtat 
agagggggaa ggtctgggtc aaattcttct 
gcagcattgt aactgcttaa taaatgacat 
atggagatac ctgaggctat tgctcaagcc 
ggtccctttc agatccagtg gccatgcccc 
tcccagcctc tccactcccc cgccagaaag 
tccctgatta tgatgagctt ccattgttct 
aaagaaactt tttaaaaacg t 



ggtgcaggat 
ctactgtact 
caccacccag 
agcacacagt 
tcccttcccc 
tttgtatgtc 
ctcactgaac 
caggccttgg 
ctgcttccca 
gagcctgagt 
gttaagtctt 



gccgcttctg 
ccctcagcat 
ctcttccctg 
tacagtgtcc 
ctacaagatt 
cagtctcctg 
gaatgagtgc 
acatttagtg 
tggttcactg 
gattctcttt 
gaagaggaat 



ctttggggcc 
cagagcatgc 
accttcacga 
taagatactg 
atgagacccc 
cacagcacct 
tgtgtaagtg 
actgttagcc 
tcattgtgtt 
tcttcttgtt 
ttaataaagc 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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DUPLICATE 



Gene Screen 

The invention relates to a screen for the identification of genes which show regulated 
expression in response to carbon source utilisation. 

5 

Colorectal cancer is a cancer which occurs in the large intestine and rectum. The 
colon can be divided into effectively four sections; the ascending colon; the 
transverse colon; the descending colon; and the sigmoid colon. Most colorectal 
cancers arise in the sigmoid colon and develop from "polyps" which can grow for 
10 several years before becoming cancerous. The early detection of these pre-cancerous 
growths is obviously desirable since removal of the polyps is a very effective means 
to stem the progress of disease. 

There are various types of colorectal cancer. Most cancers of this type are 
15 adenocarcinomas which are malignant growths which begin in the epithelial cells 
which line the colon and rectum. Other cancers of the colon and rectum include 
gastrointestinal stromal tumours and lymphomas. In some examples the patient can 
be asymptomatic and for this reason it is important that screening is undertaken to 
identify those patients in which pre-cancerous polyps are forming. However, some 
20 patients do present with symptoms and these include rectal bleeding, diarrhoea, 
constipation, abdominal pain, and general weakness. 

As mentioned above, regular screening is by far the most effective way of controlling 
this disease since removal of pre-cancerous polyps by surgery can effectively cure 

25 any disease before it is initiated. Currently, diagnostic tests include the use of 
colonoscopy, which allows a doctor to examine the rectum and colon; faecal blood 
analysis to check for any bleeding from the bowel and rectal area although this test is 
not directly diagnostic for cancerous lesion in its own right; and sigmoidoscopy 
which is similar to colonoscopy but only investigates the lower bowel area. 

30 Typically, patients with a family history of colorectal cancer can be expected to have 



1 



a colonoscopy every 5 years or so and a blood stool check on a yearly basis from 
about the age of 40. 



10 



15 



20 



25 



30 



The treatment of colorectal cancer usually involves invasive surgery to remove 
polyps and/or malignant growths. If the cancer has developed beyond the polyp stage 
then more extensive surgery is required which can result in removal of part of the 
bowel and surrounding lymph nodes. In the situation where a cancer necessitates 
extensive surgery a colostomy stoma may be required, at least for a period, to allow 
the bowel to recover from surgery. Surgery in the rectal region is more complicated 
and is largely dependent on how far the disease has progressed. In some cases the 
surgery can damage nerves which control sexual and urinary functions. In advanced 
stage colorectal cancers metastatic lesions may require removal and in about 15% of 
cases the lesions are in the liver which requires removal of large parts of the liver. 
The surgical removal of polyps and/or cancerous growths lead to a good prognosis 
for patients. In some cases surgery is followed by a course of chemotherapy (for 
colon cancer) and chemotherapy and radiation therapy (rectal cancer) to remove any 
cancer cells not detected during surgery. The chemotherapeutic agents typically used 
to treat colorectal cancer include 5-fluorouracil, leucovorin, irinotecan and 
capecitabine. 

It is apparent that the early detection of cells which are pre-cancerous is highly 
desirable since in most cases surgery to remove these cells results in a very good 
prognosis for patients. Diagnostic tests which use the detection of cancer markers as 
an early indicator of cancer are known in the art. 

For example, EP1355149 describes gene expression profiles from colorectal samples 
to provide a "finger prinf ' expression profile as an indication of whether a patient is 
susceptible to the development of colorectal cancer or indeed if malignant growth has 
already been initiated. The disclosure in EP1355149 is directed to the use of 
microarrays to compare transformed and non-transformed tissue gene expression in a 
global sense. 
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WO02/059609 also describes a gene screen which utilises expression profiles in 
breast and colorectal cancer. A comparison is made between "normal" and 
"abnormal" samples in patients to provide a global picture of gene expression in 
5 these samples as an indicator of particular genes which are either over-expressed or 
abrogated between samples. Both EP1355149 and WO02/059609 take a shot gun 
approach to screening for target genes which can be used either as a diagnostic tool 
or as a target for the development of new chemotherapeutic agents. 

10 The present invention provides a targeted screen for genes the expression of which 
may be altered in a response to carbon source. The invention makes use of the 
differences in expression profiles between normal and diseased tissue as a 
consequence of differences in metabolic state between cancer cells and normal cells 
due in part to carbon source utilisation by these respective cell types. The epithelial 
15 cells which line the colon and rectum metabolise butyrate as a carbon source for 
energy transduction via glycolysis. The main carbon source utilised by tumour cells 
is glucose. Consequently, expression profiles between these cell types are different 
due to the differences in carbon source metabolism. 

20 We have identified a large number of potential markers of colorectal cancer which 
have utility with respect to the early diagnosis of disease and as targets for the 
development of novel chemotherapeutic agents. Moreover, this assay has broader 
applicability to conditions resulting from dysfunction of the bowel (e.g colitis, 
ulcerative colitis, diversion colitis. Crohn's disease and irritable bowel syndrome. In 

25 addition the assay provides a screening tool for fibre consumption and as an assay for 
colon microflora functionality (the effectiveness of fermentation of specific fibres) . 

According to an aspect of the invention there is provided a method to screen for 
nucleic acid molecules which show altered expression in an isolated first cell sample 
30 comprising comparing the gene expression profiles between said first cell sample 
with a second reference cell sample wherein said first cell sample has been grown in 
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the presence of the carbon source butyrate, or a related carbon source from which 
butyrate is derived, either directly or indirectly, and comparing said expression 
profile with the expression profile in said second reference cell sample which has not 
been grown in the presence of butyrate, or said related carbon source. 

According to a further aspect of the invention there is provided a method to screen 
for nucleic acid molecules which show altered expression in an isolated biological 
sample comprising the steps of: 

i) providing 

a) a cell growth preparation comprising a first cell sample derived 
from at least one region of the colon; cell growth media; and a 
carbon source wherein said carbon source is butyrate; and 

b) a cell growth preparation comprising a second cell sample derived 
from an equivalent region of the colon; cell growth media; and a 

* 5 carbon source which is not butyrate; 

ii) extracting nucleic acid from said first and second cell samples; and 

iii) comparing the gene expression profile in said first cell sample with 
the gene expression profile in said second cell sample. 
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25 



30 



In a preferred method of the invention said first and second cell samples are 
derived from the ascending colon. 

In an alternative preferred method of the invention said first and second cell 
samples are derived from the transverse colon. 

In a further preferred method of the invention said first and second samples 
are derived from the descending colon. 

In a still further preferred method of the invention said first and second 
samples are derived from the sigmoid region of the colon Preferably said 
cell samples are derived from the rectal region of the colon 
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In a further preferred method of the invention said first and second cell 
samples comprise epithelial cells. 

In a preferred method of the invention said carbon source which is not 
butyrate is glucose. 

In a still further preferred method of the invention said nucleic acid molecule 
which shows altered expression is selected from the group as represented by 
the nucleic acid sequences shown in Table 1, or nucleic acid molecules 
which hybridise to the sequences presented Table 1. Preferably said nucleic 
acid molecules hybridise under stringent hybridisation conditions. 

According to a further aspect of the invention there is provided a method for 
the detection of at least one nucleic acid molecule associated with the 
initiation and/or progression of colorectal cancer, in an animal, comprising 
the steps of: 

i) providing a biological sample comprising at least one cell to 
be tested; 

ii) contacting said sample with a ligand which binds at least one 
nucleic acid molecule as represented by the nucleic acid 
sequence selected from the group consisting of: 

a) a nucleic acid molecule as represented by the nucleic 
acid sequence as shown in Table 1; 

b) a nucleic acid molecule which hybridises to nucleic 
acid molecules as defined in (a); 

c) a nucleic acid molecule that is degenerate as a 
consequence of the genetic code to the nucleic acid 
molecule represented in (a) and (b); 

iii) detecting the presence of at least one nucleic acid molecule in 
said sample. 



In a preferred method of the invention said animal is human. 

In a further preferred method of the invention said colorectal cancer is 
5 adenocarcinoma. 

In a preferred method of the invention said ligand is a nucleic acid molecule adapted 
to anneal to said nucleic acid molecule which is indicative of colorectal cancer. 

10 It will be apparent to the skilled person that a number of nucleic acid based assay 
systems are available which can be adapted to detect nucleic acid molecules as 
hereindisclosed. For example quantitative polymerase chain reaction assays, in situ 
hybridisation, northern blot 

15 According to a further aspect of the invention there is provided a method for the 
detection of at least one polypeptide associated with the initiation and/or progression 
of colorectal cancer, in an animal, comprising the steps of: 

i) providing a biological sample comprising at least one cell to be tested; 

ii) contacting said sample with at least one ligand which ligand 
specifically binds at least one polypeptide encoded by a nucleic acid 
molecule as represented by the nucleic acid sequence shown in Table 
1, or a variant polypeptide comprising an amino acid sequence which 
varies by the addition, deletion or substitution of at least one amino 
acid residue; and 

25 iii) detecting the presence of at least one polypeptide in said sample. 



20 



hi a preferred method of the invention said animal is hi 



luman. 



In a further preferred embodiment of the invention said ligand is an antibody, 
30 preferably a monoclonal antibody, or at least the effective binding part thereof. 
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Methods which utilise antibodies to detect the presence of a polypeptide in a 
biological sample are well known in the art and include ELISA's, western blot and 
immunofluoresence. 

5 According to a further aspect of the invention there is provided the use of at least one 
polypeptide, or variant sequence thereof, encoded by a nucleic acid molecule(s) as 
represented by the nucleic acid sequences as shown in Table 1, as a target for the 
screening of agents which modulate the activity of said polypeptide. 

10 According to a yet further aspect of the invention there is provided a method to 
screen for agents which modulate the activity of at least one gene associated with the 
initiation and/or progression of colorectal cancer comprising the steps of: 

i) forming a preparation comprising at least one polypeptide wherein 
said polypeptide is encoded by a nucleic acid molecule as represented 
15 by the nucleic acid sequence as shown in Table 1, or a variant 

polypeptide comprising an amino acid sequence which varies by the 
addition, deletion or substitution of at least one amino acid residue as 
represented by the amino acid sequences shown in Table 1, and at 
least one agent to be tested; and 
20 ii) determining the activity of said agent with respect to activity of said 

polypeptide. 

hi a preferred method of the invention said polypeptide is expressed by a cell wherein 
said cell is transformed or transfected with said nucleic acid molecule. Preferably 
25 said nucleic acid molecule is part of a vector adapted for recombinant expression of 
said nucleic acid molecule. Preferably said vector is provided with a promoter which 
enables the expression of said nucleic acid molecule to be regulated. 

In a preferred method of the invention said cell is derived from the colon, preferably 
30 said cell is an epithelial cell which lines said colon. 
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In a further preferred method of the invention said agent is an antibody, preferably a 
monoclonal antibody or modified antibody, or at least the effective binding part 



thereof. 



5 Antibodies, also known as immunoglobulins, are protein molecules which usually 
have specificity for foreign molecules (antigens). Immunoglobulins (Ig) are a class 
of stmcturaUy related proteins consisting of two pairs of polypeptide chains, one pair 
of light (L) (low molecular weight) chain (k or X), and one pair of heavy (H) chains 
(7, a, u, 8 and e), all four linked together by disulpbide bonds. Both H and L chains 
10 have regions that contribute to the binding of antigen and that are highly variable 
from one Ig molecule to another. In addition, H and L chains contain regions that are 
non-variable or constant 

The L chains consist of two domains. The carboxy-terminal domain is essentially 
15 identical among L chains of a given type and is referred to as the "constant" (C) 
region. The amino terminal domain varies from L chain to L chain and contributes to 
the binding site of the antibody. Because of its variability, it is referred to as the 
"variable" (V) region. 



30 



The H chains of Ig molecules are of several classes, a, m a, a, and y (of which there 
are several sub-classes). An assembled Ig molecule consisting of one or more units 
of two identical H and L chains, derives its name from the H chain that it possesses. 
Thus, there are five Ig isotypes: IgA, IgM, IgD, IgE and IgG (with four sub-classes 
based on the differences in the 'constant' regions of the H chains, i.e., IgGl, IgG2, 
IgG3 and IgG4). Further detail regarding antibody structure and their various 
functions can be found in, Using Antibodies: A laboratory manual, Cold Spring 
Harbour Laboratory Press. 

In a preferred method of the invention said fragment is a Fab fragment. 
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In a further preferred method of the invention said antibody is selected from the 
group consisting of: F(ab) 2 , Fab, Fv and Fd fragments; and antibodies comprising 
CDR3 regions. 

5 Preferably said fragments are single chain antibody variable regions (scFV's) or 
domain antibodies. If a hybridoma exists for a specific monoclonal antibody it is well 
within the knowledge of the skilled person to isolate scFv's from mRNA extracted 
from said hybridoma via RT PCR. Alternatively, phage display screening can be 
undertaken to identify clones expressing scFv's. Domain antibodies are the smallest 

10 binding part of an antibody (approximately 13kDa). Examples of this technology is 
disclosed inUS6, 248, 516, US6, 291, 158, US6,127, 197 and EP0368684 which are 
all incorporated by reference in their entirety. 

A modified antibody, or variant antibody and reference antibody, may differ in amino 
15 acid sequence by one or more substitutions, additions, deletions, truncations which 
may be present in any combination. Among preferred variants are those that vary 
from a reference polypeptide by conservative amino acid substitutions. Such 
substitutions are those that substitute a given amino acid by another amino acid of 
like characteristics. The following non-limiting list of amino acids are considered 
20 conservative replacements (similar): a) alanine, serine, and threonine; b) glutamic 
acid and asparatic acid; c) asparagine and glutamine d) arginine and lysine; e) 
isoleucine, leucine, methionine and valine and f) phenylalanine, tyrosine and 
tryptophan. Most highly preferred are variants which show enhanced biological 
activity. 

25 

Preferably said antibody is a humanised or chimeric antibody. 

A chimeric antibody is produced by recombinant methods to contain the variable 
region of an antibody with an invariant or constant region of a human antibody. 

30 
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A humanised antibody is produced by recombinant methods to combine the 
complementarity determining regions (CDRs) of an antibody with both the constant 
(C) regions and the framework regions from the variable (V) regions of a human 
antibody. 

Chimeric antibodies are recombinant antibodies in which all of the V-regions of a 
mouse or rat antibody are combined with human antibody C-regions. Humanised 
antibodies are recombinant hybrid antibodies which fuse the complimentarity 
determining regions from a rodent antibody V-region with the framework regions 
from the human antibody V-regions. The C-regions from the human antibody are also 
used. The complimentarity determining regions (CDRs) are the regions within the N- 
terminal domain of both the heavy and light chain of the antibody to where the 
majority of the variation of the V-region is restricted. These regions form loops at the 
surface of the antibody molecule. These loops provide the binding surface between 
15 die antibody and antigen. 

Antibodies from non-human animals provoke an immune response to the foreign 
antibody and its removal from the circulation. Both chimeric and humanised 
antibodies have reduced antigenicity when injected to a human subject because there 

20 is a reduced amount of rodent (i.e. foreign) antibody within the recombinant hybrid 
antibody, while the human antibody regions do not elicit an immune response. This 
results in a weaker immune response and a decrease in the clearance of the antibody. 
This is clearly desirable when using therapeutic antibodies in the treatment of human 
diseases. Humanised antibodies are designed to have less "foreign" antibody regions 

25 and are therefore thought to be less immunogenic than chimeric antibodies. 

In an alternative preferred method of the invention said agent is a polypeptide or a 
peptide. Preferably said polypeptide or peptide is modified. 

30 In a preferred method of the invention said peptide is at least 6 amino acid residues in 
length. Preferaby the length of said peptide/polypeptide is selected from the group 
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consisting of: at least 7 amino acid residues; 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 
19, or 20 amino acid residues in length. Alternatively the length of said 
peptide/polypeptide is at least 20 amino acid residues; 30; 40; 50; 60; 70; 80; 90; or 
100 amino acid residues in length. 

5 

It will be apparent to one skilled in the art that modification to the amino acid 
sequence of peptide agents could enhance the binding and/or stability of the peptide 
with respect to its target sequence. In addition, modification of the peptide may also 
increase the in vivo stability of the peptide thereby reducing the effective amount of 
10 peptide necessary to inhibit the activity of a target polypeptide. This would 
advantageously reduce undesirable side effects which may result in vivo. 
Alternatively or preferably, said modification includes the use of modified amino 
acids in the production of recombinant or synthetic forms of peptides. It will be 
apparent to one skilled in the art that modified amino acids include, by way of 
15 example and not by way of limitation, 4-hydroxyproline, 5-hydroxylysine, N 6 - 
acetyllysine, N^-methyllysine, ^^-dimethyllysine, N^^^-trimethyllysine, 
cyclohexyalanine, D-amino acids, ornithine. Other modifications include amino 
acids with a C2, C3 or C4 alkyl R group optionally substituted by 1, 2 or 3 substituents 
selected from halo (e.g. F, Br, I), hydroxy or C1-C4 alkoxy. Modifications also 
20 include, by example and not by way of limitation, acetylation and amidation. 

In a preferred embodiment of the invention said peptide sequence is acetylated. 
Preferably said acetylation is to the amino terminus of said peptide. 

25 In a further preferred embodiment of the invention said peptide sequence is amidated. 
Preferably said amidation is to tile carboxyl-terminus of said peptide. 

It will also be apparent to one skilled in the art that peptides could be modified by 
cyclisation. Cyclisation is known in the art, (see Scott et al Chem Biol (2001), 
30 8:801-815; Gellerman et al J. Peptide Res (2001), 57: 277-291; Dutta et al J. Peptide 
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Res (2000), 8: 398-412; Ngoka and Gross J Amer Soc Mass Spec (1999), 10:360- 
363. 



In a further preferred method of the invention said agent is nucleic acid molecule. 
Preferably said nucleic acid molecule is an aptamer or a modified aptamer. In an 
alternative preferred method of the invention said nucleic acid is an inhibitory RNA 
(RNAi) molecule. Alternatively said nucleic acid molecule is an antisense nucleic 
acid molecule. 



10 Nucleic acids have both linear sequence structure and a three dimensional structure 
which in part is determined by the linear sequence and also the environment in which 
these molecules are located. Conventional therapeutic molecules are small 
molecules, for example, peptides, polypeptides, , or antibodies, which bind target 
molecules to produce an agonistic or antagonistic effect. It has become apparent that 

15 nucleic acid molecules also have potential with respect to providing agents with the 
requisite binding properties which may have therapeutic utility. These nucleic acid 
molecules are typically referred to as aptamers. Aptamers are small, usually 
stablised, nucleic acid molecules which comprise a binding domain for a target 
molecule. A screening method to identify aptamers is described in US 5,270,163, 

20 which is incorporated by reference. Aptamers are typically oligonucleotides which 
may be single stranded oligodeoxynucleotides, oligoiibonucleotides, or modified 
oligodeoxynucleotide or oligoribonucleotides. 



25 



30 



The term "modified" encompasses nucleotides with a covalently modified base 
and/or sugar. For example, modified nucleotides include nucleotides having sugars 
which are covalently attached to low molecular weight organic groups other than a 
hydroxy! group at the 3' position and other than a phosphate group at the 5' position. 
Thus modified nucleotides may also include 2' substituted sugars such as 2'-0- 
methyl-; 2-O-alkyl; 2-O-allyl; 2'-S-alkyl; 2'-S-allyl; 2'- fluoro-; 2'-halo or 2;azido- 
ribose, carbocyclic sugar analogues a-anomeric sugars; epimeric sugars such as 
arabinose, xyloses or lyxoses, pyranose sugars, furanose sugars, and sedoheptulose. 
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Modified nucleotides are known in the art and include by example and not by way of 
limitation; alkylated purines and/or pyrimidines; acylated purines and/or 
pyrimidines; or other heterocycles. These classes of pyrimidines and purines are 
5 known in the art and include, pseudoisocytosine; N4, N4-ethanocytosine; 8-hydroxy- 
N6-methyladenine; 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil; 5- 
fluorouracil; 5-bromouracil; 5^arboxymemylammomemyl-2-thiouracil; 5- 
carboxymemylaminomethyl uracil; dihydrouracil; inosine; N6-isopentyl-adenine; 1- 
methyladenine; 1-methylpseudouracil; 1-memylguanine; 2,2-dimethylguanine; 2- 
10 methyladenine; 2-methylguanine; 3-methylcytosine; 5-methylcytosine; N6- 
methyladenine; 7-methylguanine; 5- memylaminomethyl uracil; 5-methoxy amino 
methyl-2-miouracil; |3-D-mannosylqueosine; 5-methoxycarbonylmemyluracil; 5- 
methoxyuracil; 2 methylthio-N6-isopentenyladenine; uracil-5-oxyacetic acid methyl 
ester; psueouracil; 2-thiocytosine; 5-methyl-2 thiouracil, 2-thiouracil; 4-thiouracil; 5- 
15 methyluracil; N-uracil-5-oxyacetic acid methylester, uracil 5 — oxyacetic acid; 
queosine; 2-thiocytosine; 5-propyluracil; 5-propylcytosine; 5-ethyluracil; 5- 
ethylcytosine; 5-butyluracil; 5-pentyluracil; 5-pentylcytosine; and 2,6,- 
diaminopurine; methylpsuedouracil; 1-memylguanine; 1-methylcytosine. 

20 The aptamers of the invention are synthesized using conventional phosphodiester 
linked nucleotides and synthesized using standard solid or solution phase synthesis 
techniques which are known in the art. Linkages between nucleotides may use 
alternative linking molecules. For example, linking groups of the formula P(0)S, 
(thioate); P(S)S, (dithioate); P(0)NR'2; P(0)R'; P(0)OR6; CO; or CONR'2 wherein 

25 R is H (or a salt) or alkyl (1-12C) and R6 is alkyl (1-9C) is joined to adjacent 
nucleotides through -O- or -S-. The binding of aptamers to a target polypeptide is 
readily testable. 

An alternative nucleic acid molecule is a so called RNAi molecule. A recent 
30 technique to specifically ablate gene function is through the introduction of double 
stranded RNA, also referred to as inhibitory RNA (RNAi), into a cell which results 
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in the destruction of mRNA complementary to the sequence included in the RNAi 
molecule. The RNAi molecule comprises two complementary strands of RNA (a 
sense strand and an antisense strand) annealed to each other to form a double 
stranded RNA molecule. The RNAi molecule is typically derived from exonic or 
coding sequence of the gene which is to be ablated Recent studies suggest that 
RNAi molecules ranging from 100-lOOObp derived from coding sequence are 
effective inhibitors of gene expression. Surprisingly, only a few molecules of RNAi 
are required to block gene expression which implies the mechanism is catalytic. The 
site of action appears to be nuclear as little if any RNAi is detectable in the cytoplasm 
of cells indicating that RNAi exerts its effect during mRNA synthesis or processing. 



In a preferred method of the invention there is provided a cassette comprising a 
nucleic acid molecule, or part thereof, wherein said molecule is selected from the 
group consisting of: 

15 ^ a nu deic acid molecule represented by the nucleic acid sequence 

shown in Table 1 ; 

ii) a nucleic acid molecule which hybridises to the sequence in (i) above 
and which encodes a polypeptide which initiates or promotes 
transformation of colon cells; or 
20 m ) a nucleic acid molecule which is degenerate because of the genetic 

code to the sequences defined in (i) and (ii) above, wherein said 
cassette is adapted such that both sense and antisense nucleic acid 
molecules are transcribed from said cassette. 

25 In a preferred method of the invention said cassette is provided with at least two 
promoters adapted to transcribe both sense and antisense strands of said nucleic acid 
molecule. 

In a further preferred method of the invention said cassette comprises a nucleic acid 
30 molecule wherein said molecule comprises a first part linked to a second part 
wherein said first and second parts are complementary over at least part of their 
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sequence and further wherein transcription of said nucleic acid molecule produces an 
RNA molecule which forms a double stranded region by complementary base pairing 
of said first and second parts. 

5 In a preferred embodiment of the invention said first and second parts are linked by at 
least one nucleotide base. 

In a preferred embodiment of the invention said first and second parts are linked by 
2, 3, 4, 5, 6, 7, 8, 9 or at least 10 nucleotide bases. 

10 

In a further preferred embodiment of the invention the length of the RNAi molecule 
is between lOObp-lOOObp. More preferably still the length of RNAi is selected from 
lOObp; 200bp; 300bp; 400bp; 500bp; 600bp; 700bp; 800bp; 900bp; or lOOObp. More 
preferably still said RNAi is at least lOOObp. 

15 

In an alternative preferred method of the invention the RNAi molecule is between 
15bp and 25bp, preferably said molecule is 21bp. Preferably said cassette is part of a 
vector. 

20 According to a further aspect of the invention there is provided an antibody identified 
by the method according to the invention for use as a pharmaceutical. 

According to a further aspect of the invention there is provided a polypeptide or 
peptide identified by the method according to the invention for use as a 
25 pharmaceutical. 

According to a further aspect of the invention there is provided a nucleic acid 
molecule identified by the method according to the invention for use as a 
pharmaceutical. 

30 

In a preferred embodiment of the invention said nucleic acid molecule is an aptamer. 
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In an alternative preferred embodiment of the invention said nucleic acid molecule is 
an inhibitory RNA. 

5 In a further alternative preferred embodiment of the invention said nucleic acid 
molecule is an antisense nucleic acid molecule. 

In a preferred embodiment of the invention said pharmaceutical further comprises a a 
diluent, carrier or excipient. 
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When administered, the therapeutic compositions of the present invention are 
administered in pharmaceutically acceptable preparations. Such preparations may 
routinely contain pharmaceutically acceptable concentrations of salt, buffering 
agents, preservatives, compatible carriers, supplementary immune potentiating agents 
such as adjuvants and cytokines and optionally other therapeutic agents, such as 
15 chemotherapeutic agents. 

The therapeutics of the invention can be administered by any conventional route, 
including injection or by gradual infusion over time. The administration may, for 
example, be oral, intravenous, intraperitoneal, intramuscular, intracavity, 
subcutaneous, or transdermal. When antibodies are used therapeutically, a preferred 
route of adrninistration is by pulmonary aerosol. Techniques for preparing aerosol 
delivery systems containing antibodies are well known to those of skill in the art. 
Generally, such systems should utilize components which will not significantly 
impair the biological properties of the antibodies, such as the paratope binding 
capacity (see, for example, Sciarra and Cutie, "Aerosols," in Reminfrfot,^ 
Pharmaceutical Sciences, 18th edition, 1990, pp 1694-1712; incorporated by 
reference). Those of skill in the art can readily determine the various parameters and 
conditions for producing antibody aerosols without resort to undue experimentation. 
When using antisense preparations of the invention, slow intravenous adrninistration 
30 is preferred. 
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The compositions of the invention are administered in effective amounts. An 
"effective amount*' is that amount of a composition that alone, or together with 
further doses, produces the desired response. In the case of treating a particular 
5 disease, such as cancer, the desired response is inhibiting the progression of the 
disease. This may involve only slowing the progression of the disease temporarily, 
although more preferably, it involves halting the progression of the disease 
permanently. This can be monitored by routine methods or can be monitored 
according to diagnostic methods of the invention discussed herein. 

Such amounts will depend, of course, on the particular condition being treated, the 
severity of the condition, the individual patient parameters including age, physical 
condition, size and weight, the duration of the treatment, the nature of concurrent 
therapy (if any), the specific route of administration and like factors within the 

15 knowledge and expertise of the health practitioner. These factors are well known to 
those of ordinary skill in the art and can be addressed with no more than routine 
experimentation. It is generally preferred that a maximum dose of the individual 
components or combinations thereof be used, that is, the highest safe dose according 
to sound medical judgment. It will be understood by those of ordinary skill in the art, 

20 however, that a patient may insist upon a lower dose or tolerable dose for medical 
reasons, psychological reasons or for virtually any other reasons. 

The pharmaceutical compositions used in the foregoing methods preferably are 
sterile and contain an effective amount for producing the desired response in a unit of 
25 weight or volume suitable for administration to a patient The response can, for 
example, be determined by measuring the physiological effects of the composition, 
such as regression of a tumour, decrease of disease symptoms, modulation of 
apoptosis, etc. 

30 The doses of pharmaceutical agent administered to a subject can be chosen in 
accordance with different parameters, in particular in accordance with the mode of 
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administration used and the state of the subject Other factors include the desired 
period of treatment. In the event that a response in a subject is insufficient at the 
initial doses applied, higher doses (or effectively higher doses by a different, more 
localized delivery route) may be employed to the extent that patient tolerance 
5 permits. 



In general, doses of pharmaceutical are formulated and administered in doses 
between 1 ng and about 500mg, and between 10 ng and lOOmg, according to any 
standard procedure in the art. Where nucleic acids are employed, doses of between 

10 1 ng and 0. lmg generally will be formulated and administered according to standard 
procedures. Other protocols for the administration of compositions will be known to 
one of ordinary skill in the art, in which the dose amount, schedule of injections, sites 
of injections, mode of administration (e.g., intra-tamoral) and the like vary from the 
foregoing. Administration of pharmaceutical compositions to mammals other than 

15 humans, e.g. for testing purposes or veterinary therapeutic purposes, is carried out 
under substantially the same conditions as described above. A subject, as used 
herein, is a mammal, preferably a human, and including a non-human primate, cow, 
horse, pig, sheep, goat, dog, cat or rodent. 

20 When administered, the pharmaceutical preparations of the invention are applied in 
pharmaceutically-acceptable amounts and in pharmaceutically-acceptable 
compositions. The term "pharmaceutically acceptable" means a non-toxic material 
that does not interfere with the effectiveness of the biological activity of the active 
ingredients. Such preparations may routinely contain salts, buffering agents, 

25 preservatives, compatible carriers, and optionally other therapeutic agents. When 
used in medicine, the salts should be pharmaceutically acceptable, but non- 
pharmaceutically acceptable salts may conveniently be used to prepare 
pharmaceutically-acceptable salts thereof and are not excluded from the scope of the 
invention. Such pharmacologically and pharmaceutically-acceptable salts include, 

30 but are not limited to, those prepared from the following acids: hydrochloric, 
hydrobromic, sulfuric, nitric, phosphoric, maleic, acetic, salicylic, citric, formic, 
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malonic, succinic, and the like. Also, pharmaceutically-acceptable salts can be 
prepared as alkaline metal or alkaline earth salts, such as sodium, potassium or 
calcium salts. 

Pharmaceutcial compositions maybe combined, if desired, with a pharmaceutically- 
acceptable carrier. The term "pharmaceutically-acceptable carrier" as used herein 
means one or more compatible solid or liquid fillers, diluents or encapsulating 
substances which are suitable for adimnistration into a human. The term "carrier" 
denotes an organic or inorganic ingredient, natural or synthetic, with which the active 
10 ingredient is combined to facilitate the application. The components of the 
pharmaceutical compositions also are capable of being co-mingled with the 
molecules of the present invention, and with each other, in a manner such that there 
is no interaction which would substantially impair the desired pharmaceutical 
efficacy. 

15 

The pharmaceutical compositions may contain suitable buffering agents, including: 
acetic acid in a salt; citric acid in a salt; boric acid in a salt; and phosphoric acid in a 
salt. 

20 The pharmaceutical compositions also may contain, optionally, suitable 
preservatives, such as: benzalkonium chloride; chlorobutanol; parabens and 
thimerosal. 

The pharmaceutical compositions may conveniently be presented in unit dosage form 
25 and may be prepared by any of the methods well-known in the art of pharmacy. All 
methods include the step of bringing the active agent into association with a carrier 
which constitutes one or more accessory ingredients. In general, the compositions 
are prepared by uniformly and intimately bringing the active compound into 
association with a liquid carrier, a finely divided solid carrier, or both, and then, if 
30 necessary, shaping the product 
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Compositions suitable for oral administration may be presented as discrete units, 
such as capsules, tablets, lozenges, each containing a predetermined amount of the 
active compound. Other compositions include suspensions in aqueous liquids or 
non-aqueous liquids such as a syrup, elixir or an emulsion. 

5 

Compositions suitable for parenteral administration conveniently comprise a sterile 
aqueous or non-aqueous preparation of pharmaceutical agents, which is preferably 
isotonic with the blood of the recipient. This preparation may be formulated 
according to known methods using suitable dispersing or wetting agents and 
10 suspending agents. The sterile injectable preparation also may be a sterile injectable 
solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for 
example, as a solution in 1,3-butane diol. Among the acceptable vehicles and 
solvents that may be employed are water, Ringer's solution, and isotonic sodium 
chloride solution. In addition, sterile, fixed oils are conventionally employed as a 
15 solvent or suspending medium. For this purpose any bland fixed oil may be 
employed including synthetic mono-or di-glycerides. In addition, fatty acids such as 
oleic acid may be used in the preparation of injectables. Carrier formulation suitable 
for oral, subcutaneous, intravenous, intramuscular, etc. administrations can be found 
in Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA. 
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An embodiment of the invention will now be described by example only and with 
reference to the following Figures and Tables; 

Figure 1 illustrates a concentration-response of cells growing in butyrate as sole 
25 carbon source. This is the summary of four independent repeat experiments. Legend 
shows butyrate concentrations in mM; 

Figure 2 illustrates the purity and quality of RNA preparation. The 28S and 18S 
sample bands are tight and clearly resolved for RNA prepared from butyrate- and 
30 glucose-grown cells. Little or no DNA or salt contamination appears in the samples; 
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Tablel illustrates nucleic acid and protein sequences identified by the screening 
method according to the invention; and 

5 Table 2 illustrates a summary of expression data of nucleic acid sequences identified 
in Tablel. 

Materials and Methods 

10 We have compared the expression profiles of colon cells growing in either glucose or 
butyrate as a carbon source. HT 29 colon carcinoma cells were cultured in DMEM 
medium (Gibco) in the presence of 10% foetal calf serum, penicillin and 
streptomycin. Cells were either cultured in glucose alone as the sole carbon source, or 
in butyrate as the sole extraneous provided carbon source. Empirical analysis of 

15 HT29 cells grown in multiple butyrate concentrations revealed that 2mM butyrate 
was optimal for cell culture in the absence of glucose. Cells were cultured in either 
medium for multiple passages (typically 4). RNA was extracted from cells grown in 
each condition and used to probe an Affymetrix human 12k array. The expression 
profile of cells cultured in each condition was compared and genes altered in 

20 expression by more than 2 fold are listed in Table 2. 



Materials used during this study 



ITEM 


TTF.M - SPECIFICS 


SUPPLIER 








Glucose medium (1) 


Dulbecco's Modified Eagle 
Medium 25 mM HEPES 1 
x 0.1 micron filtered with 
sodium pyruvate, with 1000 


GIBCO 
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mg/1 glucose with 
pyridoxine + FCS + p/s (500 
ml) 










Butyrate medium (2) 
0.2 mM NaB medium 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
niterea witn JL-glutamine 
wimoui giucose, witnout 
sodium pyruvate + NaB 
(1M) 110|xl + FCS + p/s 
(555.1 ml) 


GIBCO 








Butyrate medium (3) 
2 mM NaB medium 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
nit erect witn .L-glutamine 
wiinout glucose, without 
sodium pyruvate + NaB 

(1M) 1100 nl + FCS + p/s 
(556.1 ml) 


GIBCO 








Medium without 
giucose ana wiinout 
hutvratp {A\ 


Dulbecco's Modified Eagle 
Medium 1 x 0.1 micron 
niterea witn J-^glutamme 
without glucose, without 
sodium pyruvate + FCS + 
p/s (550 ml) 


GIBCO 








NaB stock 


Sodium Butyrate powder 
dissolved in sterile water 
250 mg in 2.27 ml water 


Sigma 
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1 


(1M) 0.2 \xm filter sterilised 










Sterile syringes 


5ml 


Becton Dickinson UK, Ltd 








Sterilising filters 


0.2 \im Acrodisc 


Gelman Sciences, Ltd 


Tfpm 
J.ICI11 


Item specifics 


Supplier 








FCS 


Foetal Calf Serum 50 ml per 
500 ml DMEM 


Harlan Sera Lab j 








P/S 


Penicillin — Streptomycin 1 
solution 100ml bottle (100 
X)- 5 ml per 500 ml 
DMEM 


Sigma 








THE for cnlittinp cells 


Trypsin Enzyme — 1 00 ml 
bottle - 3 ml per T75 and 1 
ml per 6 well plate well 


[ Sigma 








FCS tubes 


50 ml Centrifuge tubes 


j Corning Inc 








P/S + TE tubes 


30 ml Universal containers 


BibbySterilinLtd 








Tissue Culture Plates 


6 well sterile with lid single 
packed 


| Greiner bio-one 








Tissue Culture Flasks 


T75 


! Nunclon 








Stripette ® 5ml, 10ml, 


Serological Pipette, 


j Corning Inc / Costar 
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25 ml 


individually wrapped 










.ripeite 


Powerpette plus 


Jencons 








cell Counting Mide 


Haemocytometer, improved 
Neubauer 


Neubauer 








iitnanoi ior tissue 
culture 


70 % EtOH 


Sigma 








*\ /" 1 -r»1 r- T-i stall r-1-»-»T »u i-i 

v irKon ior cen culture 


1 % Virkon 


Day Impex, Ltd 








ivucroscope ior ceil 
work 


Light 6 - 10X 


CK Olympus, Tokyo 








Paper towels 


Blue 


Jamont (UK), Ltd 








Latex-free examination 
gloves 


Large 


Shermond Surgical Supply, 
Ltd 








Item 


Item specifics 


Supplier 








RNA extraction tplh opnt 


ikizoi C5> xeagent 


Invitrogen — Life 
technologies 


RNA extraction reagent 


Chloroform 


Sigma 








RNA extraction reagent 


Isopropyl alcohol 


Sigma 
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KNA extraction reagent 



75% EtOH in DEPC-treated 
water 



Sigma 



RNA extraction reagent 



Rnase-free water 



Sigma 



RNA clean up kit 



RneasyMidi Kit (10 
RNeasy midi spin columns) 



Qiagen 



P- Mercaptoethanol 



14.3 M stock solution 



Sigma 



Ethanol for Qiagen 



96-100% EtOH 



Sigma 



Agarose 



lg in 100 ml TB-EDTA- 
Bufifer 



Helena Biosciences, UK 



TB-EDTA- Buffer 



Tris-Borate-EDTA buffer 
100ml 



Sigma 



Eppendorf tubes 



1.5 ml 



Sarstedt Laboratory 
supplies, Ltd 



Loading buffer 



6X 



Promega 



The Human Colon Carcinoma Cell Line - HT29 

The HT29 cell line is established from a colon adenocarcinoma which was removed 
5 from a 44 year old Caucasian woman. The cell line is epithelial in origin and 
hypertriploid. It has been shown to be tumourigenic in nude mice and synthesizes 
Carcino embryonic antigen - CEA (Egan & Todd, 1972) and the Transforming 
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10 



15 



growth factors - TGF-a and TGF-p (Anzano et al. 1989) when maintained in vitro. 
The HT29 cell line constitutively over-produces mutant p53 protein as a consequence 
of a point mutation at codon 273, resulting in an Arginine to Histidine amino acid 
substitution (Hsu et al. 1 994). 

The Culture of HT29 C olorectal adenocarcinoma calls 

Cells were cultured in 175 tissue culture flasks (Nunclon) in 5% C0 2 at 37°C. Cells 
were passaged when confluent by washing twice in PBS and incubating in pre- 
warmed trypsin : EDTA (1:1) at 37»C until cells detached. The cells were then 
re-suspended in the appropriate growth medium, either glucose DMEM or butyrate 
DMEM before being seeded into new T75 tissue culture flasks or 6-well plates. 

Optimisation of HT29 cell growth in buty rate as sole extraneous carbon source 



HT29 cells were seeded out into 19 wells (in 6 well plates) at a cell density of 0.5 x 
10 6 cells per well (i.e. 500 000 cells per well) deduced with the aid of a 
Haemocytometer (Improved Neubauer). These cells were taken from T75 - 0.2 mM 
butyrate (NaB) DMEM flasks and allowed to adhere to the 6-well plates over 72 hrs 

20 also in 0.2 mM NaB DMEM with FCS and Penicillin / Streptomycin antibiotics. 
After the cells had adhered to the surface of the 6 well plates the 0.2 mM NaB 
DMEM was removed and each well was washed twice with PBS in order to remove 
all traces of the 0.2 mM DMEM then different concentrations of NaB DMEM with 
FCS and with Penicillin / Streptomycin antibiotics were added to the appropriate 

25 wells in triplicate. Cell counts were taken at various time points. Specific media was 
changed daily in order to maintain the appropriate / desired NaB concentrations per 
well. All solutions / reagents used were pre-warmed in a water bath prior to use so as 
to avoid any cold shock to the cells. 
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RNA extraction using TRIzol® Reagent 



Total RNA was extracted from HT29 cells grown to confluence in T75 flasks using 
TRIzol Reagent as per manufacturer's recommendations. Cells were grown for 
several passages either in butyrate-containing medium, or in glucose-containing 
medium prior to extraction of RNA 

Cells were homogenised using 1 ml TRIzol Reagent per 10 cm 2 area of culture 
surface. The homogenised samples were incubated for 5 minutes at at ambient 
temperature to permit the complete dissociation of nucleoprotein complexes. 200pl 
of chloroform was added to each sample. Tubes were shaken vigorously by hand for 
15 seconds and incubated at ambient temperature for 3 minutes. Samples were 
centrifiiged at 12000g for 15 minutes at 4oC. RNA in the aqueous phase was 
separated and precipitated using isopropyl alcohol. RNA was rinsed, air dried and 
redissolved in RNase-free water. 

RNA was further purified using Qiagen RNeasy columns. The columns were used 
exactly as per manufacturers recommendations. RNA was eluted into RNase-free 
water. 

RNA purified in this way was analysed by agarose gel to establish purity and quality. 
The gel is shown in figure 2. 

Microarrav analysis 

Microarray analysis was undertaken as a commercial service by the University of 
Newcastle-upon-Tyne. In this study, die 2 RNA samples (lx butyrate + Ix glucose) 
from the 2 experimental conditions (butyrate + glucose) were sent to the Institute for 
Human Genetics at the University of Newcastle-upon-Tyne for microarray analysis. 
This was performed on a 12 k Affymetrix Homo sapiens gene chip. Genes altered in 
expression by more than 2 fold on the microarray are listed in table 1 . 
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Claims 



1. A method to screen for nucleic acid molecules which show altered expression 
in an isolated first cell sample comprising comparing the gene expression profiles 

5 between said first cell sample with a second reference cell sample wherein said first 
cell sample has been grown in the presence of the carbon source butyrate, or a related 
carbon source from which butyrate is derived, either directly or indirectly, and 
comparing said expression profile with the expression profile in said second 
reference cell sample which has not been grown in the presence of butyrate, or said 
1 0 related carbon source. 

2. A method according to Claim 1 wherein said screen for nucleic acid 
molecules comprises the steps of: 

i) providing 

15 a) a cell growth preparation comprising a first cell sample derived 

from at least one region of the colon; cell growth media; and a 
carbon source wherein said carbon source is butyrate; and 
b) a cell growth preparation comprising a second cell sample derived 
from an equivalent region of the colon; cell growth media; and a 

20 carbon source which is not butyrate; 

ii) extracting nucleic acid from said first and second cell samples; and 

iii) comparing the gene expression profile in said first cell sample with 
the gene expression profile in said second cell sample. 

25 3. A method according to Claim 1 or 2 wherein said first and second cell 

samples are derived from the ascending colon. 

4. A method according to Claim 1 or 2 wherein said first and second cell 
samples are derived from the transverse colon. 

30 
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5. A method according to Claim 1 or 2 wherein said first and second 
samples are derived from the descending colon. 

6. A method according to Claim 1 or 2 wherein said first and second 
samples are derived from the sigmoid region of the colon, 

7. A method according to Claim 6 wherein said cell samples are derived 
from the rectal region of the colon. 

8. A method according to any of Claims 1-7 wherein said first and 
second cell samples comprise epithelial cells. 

9. A method according to any of Claims 1-8 wherein said carbon source 
which is not butyrate is glucose. 

10. A method according to any of Claims 1-9 wherein said nucleic acid 
molecule which shows altered expression is selected from the group as 
represented by the nucleic acid sequences as shown in Table 1, or nucleic 
acid molecules which hybridise to the sequences presented in Table 1. 

11. A method for the detection of at least one nucleic acid molecule 
associated with the initiation and/or progression of colorectal cancer, in an 
animal, comprising the steps of: 

i) providing a biological sample comprising at least one cell to 
be tested; 

ii) contacting said sample with a ligand which binds at least one 
nucleic acid molecule as represented by the nucleic acid 
sequence selected from the group consisting of: 

a) a nucleic acid molecule as represented by the nucleic acid 
sequence as shown in Table 1; 
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b) a nucleic acid molecule which hybridises to nucleic acid 
molecules as defined in (a); 

c) a nucleic acid molecule that is degenerate because of the genetic 
code to the nucleic acid molecule represented in (a) and (b); and 

5 iii) detecting the presence of at least one nucleic acid molecule in 

said sample. 

12. A method according to Claim 11 wherein said colorectal cancer is 
adenocarcinoma. 

10 

13. A method according to Claim 11 or 12 wherein said ligand is a nucleic acid 
molecule adapted to anneal to said nucleic acid molecule which is associated with 
colorectal cancer. 

15 14. A method according to Claim 13 wherein said method is a polymerase chain 
reaction method. 

15. A method for the detection of at least one polypeptide associated with the 
initiation and/or progression of colorectal cancer, in an animal, comprising the steps 
20 of: 

i) providing a biological sample comprising at least one cell to be tested; 

ii) contacting said sample with at least one ligand which ligand 
specifically binds at least one polypeptide encoded by a nucleic acid 
molecule as represented by the nucleic acid sequence as shown in 

25 Table 1, or a variant polypeptide comprising an amino acid sequence 

which varies by the addition, deletion or substitution of at least one 
amino acid residue of the amino acid sequence shown in Table 1; and 

iii) detecting the presence of at least one polypeptide in said sample. 

30 16 A method according to any of Claims 11-15 wherein said animal is human. 



30 



17. 



A method according to Claim 15 or 16 wherein said ligand is an antibody. 



18. A method according to Claim 17 wherein said antibody is a monoclonal 
antibody, or at least the effective binding part thereof. 

5 

19. The use of at least one polypeptide, or variant sequence thereof, encoded by a 
nucleic acid molecule(s) as represented by the nucleic acid sequence as shown in 
Table 1, as a target for the screening of agents which modulate the activity of said 
polypeptide. 

10 

20. A method to screen for agents which modulate the activity of at least one 
polypeptide encoded by a gene associated with the initiation and/or progression of 
colorectal cancer comprising the steps of: 

i) forming a preparation comprising at least one polypeptide wherein 
15 said polypeptide is encoded by a nucleic acid sequence as shown in 

Table 1, or a variant polypeptide comprising an amino acid sequence 
which varies by the addition, deletion or substitution of at least one 
amino acid residue of the amino acid sequence shown in Table 1 and 
at least one agent to be tested; and 
20 ii) determining the activity of said agent with respect to activity of said 

polypeptide. 

21. A method according to Claim 20 wherein said polypeptide is expressed by a 
cell wherein said cell is transformed or transfected with said nucleic acid molecule. 

25 

22. A method according to Claim 21 wherein said nucleic acid molecule is part of 
a vector adapted for recombinant expression of said nucleic acid molecule. 

23. A method according to Claim 22 wherein said vector is provided with a 
30 promoter which enables the expression of said nucleic acid molecule to be regulated. 
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24. A method according to any of Claims 2 1-23 wherein said cell is derived from 
the colon. 

25. A method according to Claim 24 wherein said cell is an epithelial cell. 

5 

26. A method according to any of Claims 20-25 wherein said agent is an 
antibody. 

27. A method according to Claim 26 wherein said antibody is a monoclonal 

10 antibody or modified monoclonal antibody, or at least the effective binding part 
thereof. 

28. A method according to Claim 27 wherein said binding part is a Fab fragment. 

15 29. A method according to Claim 28 wherein said antibody is selected from the 
group consisting of: FCab 1 ^, Fab, Fv and Fd fragments; antibodies comprising CDR3 
regions, and single chain antibody variable regions. 

30. A method according to Claim 26 wherein said antibody is a humanised. 

20 

31. A method according to Claim 26 wherein said antibody is a chimeric 
antibody. 

32. A method according to any of Claims 20-25 wherein said agent is a 
25 polypeptide. 

33. A method according to any of Claims 20-25 wherein said agent is a peptide. 

34. A method according to any of Claims 20-25 wherein said agent is nucleic acid 
30 molecule. 
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35. A method according to Claim 34 wherein said nucleic acid molecule is an 
aptamer. 

36. A method according to Claim 34 wherein said nucleic acid is an inhibitory 
5 RNA molecule. 



37. A method according to Claim 36 wherein said inhibitory RNA is encoded by 
a transcription cassette comprising a nucleic acid molecule, or part thereof, selected 
from the group consisting of: 
10 i) a nucleic acid molecule as represented by the nucleic acid sequence 

as shown in Table 1; 

ii) a nucleic acid molecule which hybridises to the sequence in (i); or 

iii) a nucleic acid molecule which is degenerate because of the genetic 
code to the sequences defined in (i) and (ii) above; wherein said 

15 cassette is adapted such that both sense and antisense nucleic acid 

molecules are transcribed from said cassette. 



38. A method according to Claim 37 wherein said cassette is provided with at 
least two promoters adapted to transcribe both sense and antisense strands of said 

20 nucleic acid molecule. 

39. A method according to Claim 37 wherein said cassette comprises a nucleic 
acid molecule wherein said molecule comprises a first part linked to a second part 
wherein said first and second parts are complementary over at least part of their 

25 sequence and further wherein transcription of said nucleic acid molecule produces an 
RNA molecule which forms a double stranded region by complementary base pairing 
of said first and second parts. 

40. A method according to Claim 34 wherein said nucleic acid molecule is an 
30 antisense nucleic acid molecule. 
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41. An antibody, or effective binding part thereof; identified by the method 
according to any of Claims 26-3 1 for use as a pharmaceutical. 

42. A polypeptide identified by the method according to Claim 32 for use as a 
5 pharmaceutical. 

43. A peptide identified by the method according to Claim 33 for use as a 
pharmaceutical. 

10 44. A nucleic acid molecule identified by the method according Claim 34 for use 
as a pharmaceutical. 

45. Use according to Claim 44 wherein said nucleic acid molecule is an aptamer. 

1 5 46. Use according to Claim 44 wherein said nucleic acid molecule is an inhibitory 
RNA. 

47. Use according to Claim 44 wherein said nucleic acid molecule is an antisense 
nucleic acid molecule. 

20 48. Use according to any of Claims 41-47 wherein said pharmaceutical further 
comprises a a diluent, carrier or excipient. 



25 
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Abstract 



We describe a method for the identification of genes which show regulated 
expression in response to carbon source utilisation, typically genes associated with 
the initiation and/or promotion of cell transformation from a non-cancerous to a 
cancerous phenotype, typically of cells found in the colon; the use of these genes in 
diagnostic assays and as targets for the development of chemotherapeutic drugs and 
agents identified by said assay. 
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."ABLE 1 



VC J02966; 

)E Human mitochondrial ADP/ADT translocator mRNA, complete cds . 

wX 

CW ADP/ADT translocator. 

'translation="MGDHAWSFIiKDFLAGAVAAAVSKTAVAPIERVKLLIiQVQHASKQI 

T S AEKQYKGI IDCWRI PKEQGFLSFWRGNIiANVIRYFPTQALNFAFKDKYKQL FLGGVD 

RHKQ F WRYFAGNLAS GGAAG ATS LC FVYPLD FARTRJjAAD VGRRAQRE FHGLGD C 1 1 KI 
FKSDGLRGLYQGFNVS VQGI 1 I YRAAYFGVYDTAKGMLPDPKNVHI FVS WM I AQS VTAV 
AGLLSYPFDTVRRRMMMQSGRKGADIMYTGTVDCWRK^ 
GGAFVT.VT ,YDR TTTKW » 



GGAFVLVIi YDE IKKYV 1 
Sequence 1320 BP; 341 A; 304 C; 357 G; 318 T; 0 other; 



gtgggcgaga 


60 


atcacgcttg 


120 


agaccgcggt 


180 


aacagatcag 


240 


aggagcaggg 


300 


ccacccaagc 


360 


tggatcggca 


420 


ctggggccac 


480 


ctgatgtggg 


540 


tcttcaagtc 


600 


tcattatcta 


660 


accccaagaa 


720 


cagggctgct 


780 


ggaaaggggc 


840 


acgaaggagc 


900 


gtgcttttgt 


960 


caagttcaca 


1020 


tagactattc 


1080 


cagaagagat 


1140 


atttaaattc 


1200 


tgataataac 


1260 


aaaaaaaaaa 


1320 



HSA132099 standard; mRNA; HUM; 3109 BP. 
Homo sapiens mRNA for VNN1 protein 

vanin-like gene; vnnl gene; VNN1 protein. 

/protein_id= M CAA10568 . 1" 

/trans 1 at ion= n MTTQIiPAYVAI LLFYVSRAS CQDTF I AAVYEHAAI LPNATLTP VS 
REEALALMNTRNliDILEGAITSAADQGAHI IVTPEDAI YGWNFNRDSI.YP YliED I PDPEV 
NWI PCNNRNRFGQTPVQERIjSCIjAKNNS I YWANI GDKKPCDTSDPQCPPDGRYQYNTD 
VVFDSQGKLVARYHKQNIjFMGENQFNVPKEPEIVTFNTTFGS FGI FTCPDIIiFHDPAVT 
LVKDFHVDTIVFPTAWMNVLPHLSAVBFHSAW 

PNSSRAFHYDMKTEEGKLLLSQIjDSHPSHSAVVI^TSYASSIEALSSGNKEFKGTVFFD 
EFTFVKLTGVAGNYTVCQKDLCCHIjSYKMSENIPireVYAIjGA^ 
LLKCKTTNLNTCGDSAETASTRFEMFSLSGTFGTQYVFPEVLIiSENQIiAPGEFQVS 
RLFSLKPTSGPVLTVTIiFGRLYEKDWASNAS SGLTAQARI IMLIVIAP IVCSLSW « 

Sequence 3109 BP; 973 A; 630 C; 601 G; 905 T; 0 other; 

cattggactt cagcatgact actcagttgc cagcttacgt ggcaattttg cttttctatg 60 

tctcaagagc cagctgccag gacactttca ttgcagctgt ttatgagcat gcagcgatat 120 

tgcccaatgc caccctaaca ccagtgtctc gtgaggaggc tttggcatta atgaatcgga 180 

atctggacat tttggaagga gcgatcacat cagcagcaga tcagggtgcg catattattg 240 

tgactccaga agatgctatt tatggctgga acttcaacag ggactctctc tacccatatt 3 00 

tggaggacat cccagaccct gaagtaaact ggatcccctg taataatcgt aacagatttg 3 60 

gccagacccc agtacaagaa agactcagct gcctggccaa gaacaactct atctatgttg 420 

tggcaaatat tggggacaag aagccatgcg ataccagtga tcctcagtgt ccccctgatg 4 80 

gccgttacca atacaacact gatgtggtat ttgattctca aggaaaactg gtggcacgct 540 

accataagca aaaccttttc atgggtgaaa atcaattcaa tgtacccaag gagcctgaga 600 

ttgtgacttt caataccacc tttggaagtt ttggcatttt cacatgcttt gatatactct 660 

tccatgatcc tgctgttacc ttggtgaaag atttccacgt ggacaccata gtattcccaa 72 0 

cagcttggat gaatgttttg ccacatttgt cagctgttga attccactca gcttgggcta 780 

tgggcatgag ggtcaatttc cttgcatcca acatacatta cccctcaaag aaaatgacag 840 

gaagtggcat ctatgcaccc aattcttcaa gagcatttca ttatgatatg aagacagaag 900 

agggaaaact cctcctctcg caactggatt cccacccatc ccattctgca gtggtgaact 960 

ggacttccta tgccagcagt atagaagcgc tctcatcagg aaacaaggaa tttaaaggca 1020 

ctgtcttttt cgatgaattc acttttgtga agctcacagg agttgcagga aattatacag 1080 

tttgtcagaa agate tetge tgtcatttaa gctacaaaat gtctgagaac ataccaaatg 1140 

aagtgtacgc tctaggggca tttgaeggae tgcacactgt ggaagggege tattatctac 1200 

agatttgtac cctgttgaaa tgtaaaacga ctaatttaaa cacttgeggt gactcagctg 1260 

aaacagcttc taccaggttt gaaatgttct ccctcagtgg cactttegga acccagtatg 1320 

tctttcctga ggtgttgctg agtgaaaatc agcttgcacc tggagaattt caggtgtcaa 1380 

ctgacggacg cttgtttagt ctgaagccaa catccggacc tgtcttaaca gtaactctgt 1440 

ttgggaggtt gtatgagaag gactgggcat caaatgette atcaggcctc acagcacaag 1500 

caagaataat aatgetaata gttatagcac ctattgtatg ctcattaagt tggtagaata 1560 

ttgacttttt ctctttttta tttgggataa tttaaaaaat gatggatgag aaaagaaaga 1620 

ttggtccggg ttaatattat cctctagtat aagtgaatta ctagtttctc tttatttaga 1680 

caaacacaca cacaccagat aatataaact taataaatta tctgttaatg tagattttat 1740 

ttaaaaaact atatttgaac attggtcttt cttggacgtg agctaattat atcaaataag 1800 

tatcacaaat ettttacgea gaagaaataa aaactacggg tagaaaacat aagaactatc 1860 

ataaaattta cttacaagga ggctgetett gttaccactt ttattatatt aegtatcact 1920 

tattcagctc tgctgaaaat ttccaatgac tttgtttgtt tgctctttta gttttttacc 1980 

taaacaatac attttgattc tcttgtgggt tgataatgtc tccccaaaat ttacatgttg 2040 

aagcacctca gaatgtgact gtatttggag acagggtctt taaagaggta aaataaggtc 2100 

attaggatag accctaattc aatatgactg atgatcataa aagaagaggc gagtagggca 2160 

caacaggcac aaagggagac cataaggaga cacagaggaa ggacaactct ttacaagcta 2220 

agaagagagg gectcagaag aaaccaaccc tgccaacacc ttgatcttgg acttccagcc 2280 

tccaaaacta tgagaaataa atttctattg tttaagtcac ccagtccatg gtactttgtt 2340 

aggcagccct ggcaaatgaa tcaaagaccc attcctgttc ctctccccac cactactgtt 2400 

ttctactgta atctgaagct tcaacaaaag gcttacctgg taagaatatt cagctggtct 2460 



gggtcctcaa gactccaata gacactctta aagaaggatt gctgatggat tgatagtgaa 2520 

accattagat cattgaattc ctctggaatt agaaaaccag agagtcccat tttaagaaat 2580 

tagatattta atatagcatt gtgtgttcta ttttagtaac agcagaatct cttgacatta 2640 

cacaactcag tgaaacaaca tcatttaagc caaaatatct cccaactgac tgatagactc 2700 

tgagcactaa tatcatagtg ctgtgatgat ggacaattac atagtaccga taacagccat 2760 

gcactgtgca aagcatgccc ttctgcacag gagagcaagg cacttgcagt agtgatctat 2820 

gccagcaaaa catcattttg agacaaacat ttttgtggca gatgtttttc ctaaaaagta 2 880 

ctatatcatc caagaaatat ttgagtaaaa tcccttgttc ttttgggtga cattaactga 2940 

catttgcttt ttttcaagac ctaatagaaa ataagaaagc ccataatgta tttagaaaca 3 000 

ggaatcctca gagcaattct ctgtattctc atataatttc aatgtaaaac agaaaacata 3060 

ttgatgtgtt ggtgataggc ttgaattatt aaaaacttca aaaacaaaa 3109 




Homo sapiens transmembrane protein 5, mRNA (cDNA clone MGC: 17085 
IMAGE : 3919181) , complete cds . 

/protein_id=»AAH13152 .1" 

/ 1 r ans 1 at i on= " MRLTRKRLCS FL I ALYCIjFS LYAAYHVFFGRRRQAPAGS PRGLRK 
GAAPARERRGREQSTLESEEWNPWEGDEKNEQQHRFKTSLQILD 

KAAIGLYLWEHI FEGLLDPSDVTAQWREGKS I VGRTQYSF ITGPAVI PGYFSVDVNNW 
LI LNGREKAKI FYATQWLLYAQNLVQIQKLQHIiAVVLLGNEHCDNEWINPFLKRNGGFV 
ELLFIIYDSPWIITOVDVFQWPIjGVATYRNFPVVEASWSMIjHDERPYLCNFLGTIYENSS 
RQALMNILKKDGNDKLCWVSAREHWQPQETNESLKNYQDALLQSDLTLCPV 
IYEACSYGSIPWEDVMTAGNCGNTSVHHGAPLQ^^ 
KTIILQEKIERRK^LQWYQHFKTELKMKFTO " 



Sequence 1469 BP; 446 A; 300 C; 349 G; 374 T; 0 other; 

ggctgggcct gcctcggacg ccgccggtgt cgcggattct ctttccgccc gctccatggc 60 

ggtggatgcc tgactggaag cccgagtggg atgcggctga cgcggaagcg gctctgctcg 120 

tttcttatcg ccctgtactg cctattctcc ctctacgctg cctaccacgt cttcttcggg 180 

cgccgccgcc aggcgccggc cgggtccccg cggggcctca ggaagggggc ggcccccgcg 240 

cgggagagac gcggccgaga acagtccact ttggaaagtg aagaatggaa tccttgggaa 3 00 

ggagatgaaa aaaatgagca acaacacaga tttaaaacta gccttcaaat attagataaa 360 

tccacgaaag gaaaaacaga tctcagtgta caaatctggg gcaaagctgc cattggcttg 420 

tatctctggg agcatatttt tgaaggctta cttgatccca gcgatgtgac tgctcaatgg 480 

agagaaggaa agtcaatcgt aggaagaaca cagtacagct tcatcactgg tccagctgta 540 

ataccagggt acttctccgt tgatgtgaat aatgtggtac tcattttaaa tggaagagaa 600 

aaagcaaaga tcttttatgc cacccagtgg ttactttatg cacaaaattt agtgcaaatt 660 

caaaaactcc agcatcttgc tgttgttttg ctcggaaatg aacattgtga taatgagtgg 720 

ataaacccat tcctcaaaag aaatggaggc ttcgtggagc tgcttttcat aatatatgac 780 

agcccctgga ttaatgacgt ggatgttttt cagtggcctt taggagtagc aacatacagg 840 

aattttcctg tggtggaggc aagttggtca atgctgcatg atgagaggcc atatttatgt 900 

aatttcttag gaacgattta tgaaaattca tccagacagg cactaatgaa cattttgaaa 960 

aaagatggga acgataagct ttgttgggtt tcagcaagag aacactggca gcctcaggaa 1020 

acaaatgaaa gtcttaagaa ttaccaagat gccttgcttc agagtgatct cacattgtgc 1080 

ccggtcggag taaacacaga atgctatcga atctatgagg cttgctccta tggctccatt 1140 

cctgtggtgg aagacgtgat gacagctggc aactgtggga atacatctgt gcaccacggt 1200 

gctcctctgc agttactcaa gtccatgggt gctcccttta tctttatcaa gaactggaag 1260 

gaactccctg ctgttttaga aaaagagaaa actataattt tacaagaaaa aattgaaaga 1320 

agaaaaatgt tacttcagtg gtatcagcac ttcaagacag agcttaaaat gaaatttact 13 80 

aatattttag aaagctcatt tttaatgaat aataaaagtt aattatcttt ttgagctaaa 1440 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 
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Homo sapiens CD3e-associated protein (CAST) mRNA, complete cds . 
/protein_id="AAD41158 .1" 

/translation^ "MEEPQAGGEDAARFSCPPNFTAKPPASESPRFSIjEALTGPDTELW 
L I QAPAD FAPE CFNGRHVPL S GS Q I VKGKLAGKRHRYRVL S S C PQAGEATLLAP S TEAG 

GGLTCASAPQGTLRIIjEGPQQSLSGSPIiQPIPASPPPQIPPGIiRPRFCAFGGNPPVTGP 
RSALAPNLLTSGKKKKEMQVTEAPVTQEAVNGHGALEVDMALGS PEMDVRKKKKKKNQQ 
LKEPEAAGPVGTEPTVETLEPLGVl.FPSTTKKRKKPKGKETFEPEDKTVKQEQIWT^ 
EDTVLSPTKKRKRQKGTEGMEPEEGVTVESQPQVKVEPLEEAIPIiPPT 
MMEPGTEAMEPVEPEMKPLESPGGTMAPQQPEGAKPQA 




Sequence 1841 BP; 512 A; 502 C; 576 G; 251 T; 0 other; 
cccaggatgg aggagcccca ggccggcggt gaggatgctg ^ 
aactttaccg cgaagccccc agcctcagag tcccctcgtt 



w — 3~« 3 «*» 3 vo*j ayuwayyaaa 

ccccaagagg agatgccagg gccgccactg aattcagagt 



agtcattaaa ggagctgttt cctgggtaaa 



ct cocrttft* r* 


LLy L.CCCCCC 


60 


t f - f* c* 1~ t~ rrrr st 

fc» v» w» L> MCl 


ggcgcxgacg 


120 




agaatgcttc 


180 




yyc ayycaag 


24 0 






1 Aft 


ccccccaggg 


caccctaagg 


360 


tgcagcccat 


cccagcaagt 


420 


gtgcctttgg 


gggcaaccca 


480 


tcacctcagg 


gaagaagaaa 


540 


cagtgaatgg 


gcacggggcc 


600 


tgcggaagaa 


gaagaagaaa 


660 


tggggacaga 


gcccacagtg 


720 


ccaagaagag 


gaagaagccc 


780 


agcaggaaca 


gattaacact 


840 


gaaagaggca 


aaaggggacg 


900 


agccacaggt 


gaaggtggag 


960 


ggaaaaaaga 


aaagggacag 


1020 


tggagccgga 


gatgaagcct 


1080 


aaggagcgaa 


gcctcaggcc 


1140 


aaaaacagca 


agatgccaca 


1200 


atgaccttga 


gcctcaggca 


1260 


gaggtcacac 


agtgactgag 


1320 


agcctgaagc 


cagggcaact 


1380 


gccggatgcc 


agagacagtg 


1440 


ctggggagga 


ggctcccaca 


1500 


tgtagtctgc 


ccccgggaaa 


1560 


ccagaaggtg 


acacccccag 


1620 


cctggcctgg 


gaggacgatt 


1680 


agggtacttt 


caagaagggc 


1740 


ggagaaaaaa 


atgaggaacc 


1800 




3 Homo sapiens Apo-2 ligand mRNA, complete cds. 

/translations "MAMMEVQGGPSLGQTC^/IjIVIFTVIiLQSIjCVAW 

QDKYS KSGIACFLKEDDS YWDPITOEES^S PCWQVKWQLRQLVRKM IIiRTSEETI STVQ 

EKQQNIS PIiVRERGPQRVAAHITGTRGRSNTLS SPNSKNEKALGRKINSl^ES SRSGHSF 

LSOTjHLRNGELVIHEKGFYYIYSQTYFRFQEEIKEOTKK^ 

MKS ARNS CWSKDAEYGIiYS I YQGGIFELKENDRIFVSVTNEHLIDMDHEASFFGAFLVG 
tt 

3'UTR 937.. 1042 

Sequence 1042 BP; 348 A; 208 C; 232 G; 254 T; 0 other; 
tttcctcact gactataaaa gaatagagaa ggaagggctt cagtgaccgg ctgcctggct 
gacttacagc agtcagactc tgacaggatc atggctatga tggaggtcca ggggggaccc 
agcctgggac agacctgcgt gctgatcgtg atcttcacag tgctcctgca gtctctctgt 
gtggctgtaa cttacgtgta ctttaccaac gagctgaagc agatgcagga caagtactcc 
aaaagtggca ttgcttgttt cttaaaagaa gatgacagtt attgggaccc caatgacgaa 
gagagtatga acagcccctg ctggcaagtc aagtggcaac tccgtcagct cgttagaaag 
atgattttga gaacctctga ggaaaccatt tctacagttc aagaaaagca acaaaatatt 
tctcccctag tgagagaaag aggtcctcag agagtagcag ctcacataac tgggaccaga 
ggaagaagca acacattgtc ttctccaaac tccaagaatg aaaaggctct gggccgcaaa 
ataaactcct gggaatcatc aaggagtggg cattcattcc tgagcaactt gcacttgagg 
aatggtgaac tggtcatcca tgaaaaaggg ttttactaca tctattccca aacatacttt 
cgatttcagg aggaaataaa agaaaacaca aagaacgaca aacaaatggt ccaatatatt 
tacaaataca caagttatcc tgaccctata ttgttgatga aaagtgctag aaatagttgt 
tggtctaaag atgcagaata tggactctat tccatctatc aagggggaat atttgagctt 
aaggaaaatg acagaatttt tgtttctgta acaaatgagc acttgataga catggaccat 
gaagccagtt ttttcggggc ctttttagtt ggctaactga cctggaaaga aaaagcaata 
acctcaaagt gactattcag ttttcaggat gatacactat gaagatgttt caaaaaatct 
raccaaaaca aacaaacaga aa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



E Homo sapiens mRNA for annexin A13 (ANXA13 gene) , isof orm b 



T /protein_id=»CAC34622 . 1" 

T / translation "MCTn^QSYTLSEGSQQLPKGDSQPSTWQPLSHPSRNGEPEAPQ 

T PAKASSPQGFDVDWDAKKLNKACKGMGTNEAAI IE ILSGRTSD 

T LiEEVLKSELSGNFEKTAIiALLDRPSEYAT^QLQKAM^ I 

T AIKEAYQRLFDRSLESDVKGDTSGNLKKILVSLLQANRNEGDDVDKDLAGQDAKDLYDA 

T GEGRWGTDEIjAFNEVIjAKRSYKQLRATFQAYQIIjIGKDIEEAIEEETSGDLQKAYLTLV 

t RC^QDCEDYFAERLYKSMKGAGTDEETLIRIVVTRAEVDLQGIKAKFQEKYQKBLSDOT 

T RSDTSGDFRKLIiVALLH " 

T exon 84.. 2 06 



T /gene="ANXAl3» 

Q Sequence 1588 BP; 484 A; 351 C; 410 G; 343 T; 0 other; 

gtaaactttg cctgtaggag gactgatctc ttaatgaaat acagaaaaac catctcagaa 60 

aaaggaaaat gggcaatcgt catagccagt cgtacaccct ctcagaaggc agtcaacagt 120 

tgcctaaagg ggactcccaa ccctcgacag tcgtgcagcc tctcagccac ccatcacgga 180 

atggagagcc agaggcccca cagcctgcta aagcgagcag tcctcagggt tttgatgtgg 24 0 

atcgagatgc caaaaagctg aacaaagcct gcaaaggaat ggggaccaat gaagcagcca 300 

tcattgaaat cttatcgggc aggacatcag atgagaggca acaaatcaag caaaagtaca 360 

aggcaacgta cggcaaggag ctggaggaag tactcaagag tgagctgagt ggaaacttcg 420 

agaagacagc gttggccctt ctggaccgtc ccagcgagta cgccgcccgg cagctgcaga 480 

aggctatgaa gggtctgggc acagatgagt ccgtcctcat tgaggtcctg tgcacgagga 540 

ccaataagga aatcatcgcc attaaagagg cctaccaaag gctatttgat aggagcctcg 600 

aatcagatgt caaaggtgat acaagtggaa acctaaaaaa aatcctggtg tctctgctgc 660 

aggctaatcg caatgaagga gatgacgtgg acaaagatct agctggtcag gatgccaaag 720 

atctgtatga tgcaggggaa ggccgctggg gcactgatga gcttgcgttc aatgaagtcc 780 

tggccaagag gagctacaag cagttacgag ccacctttca agcctatcaa attctcattg 840 

gcaaagacat agaagaagcc attgaagaag aaacatcagg cgacttgcag aaggcctatt 900 

taactctcgt gagatgtgcc caggattgtg aggactattt tgctgaacgt ctgtacaagt 960 

cgatgaaggg tgcggggacc gatgaggaga cgttgattcg catagtcgtg accagggccg 1020 

aggtggacct tcaggggatc aaagcaaagt tccaagagaa gtatcagaag tctctctctg 1080 

acatggttcg ctcagatacc tccggggact tccggaaact gctagtagcc ctcttgcact 1140 

gagccaagcc agggcaatag gaacacaggg tggaaccacc tttgtcaaga gcacattcca 1200 

aatcaaactt gcaaatgaga ctcccgcacg aaaaccctta agagtcccgg attactttct 1260 

tggcagctta agtggcgcag ccaggccaag ctgtgtaagt taagggcagt aacgttaaga 1320 

tgcgtgggca gggcaccttg aactctggct tagcaagcat ctaggctgcc tcttcacttt 1380 

cttttagcat ggtaactgga tgttttctaa acactaatga aatcagcagt tgatgaaaaa 1440 

actatgcatt tgtaatggca catttagaag gatatgcatc acacaagtaa ggtacaggaa 1500 

agacaaaatt aaacaattta ttaattttcc ttctgtgtgt tcaatttgaa agcctcattg 1560 
ttaattaaag ttgtggatta tgcctcta 




)E Homo sapiens serine protease inhibitor, Kazal type 1, mRNA (cDNA clone 



Sequence 362 BP; 121 A; 74 C; 75 G; 92 T; 0 other; 
cgcagaactt cagccatgaa ggtaacaggc atctttcttc tcagtgcctt ggccctgttg 
agtctatctg gtaacactgg agctgactcc ctgggaagag aggccaaatg ttacaatgaa 
cttaatggat gcaccaagat atatgaccct gtctgtggga ctgatggaaa tacttatccc 
aatgaatgcg tgttatgttt tgaaaatcgg aaacgccaga cttctatcct cattcaaaaa 
tctgggcctt gctgagaacc aaggttttga aatcccatca ggtcaccgcg aggcctgact 
ggccttattg ttgaataaat gtatctgaat atcaaaaaaa aaaaaaaaaa aaaaaaaaaa 



60 
120 
180 
240 
300 
360 



E completes B CSl1 1±nker P rotein mRNA, alternatively spliced, 

T / translations "ITOKLNKITVPASQKI^QIjQKMVHDIKNNEGGIMNKIKKLKVKAPP 

Z, SVPRRDYASESPADEEEQWSDDFDSDYENPDEHSDSEMYVMPAEENADDSYEPPPVEQE 

T TRPVHPALPPARGEYIDNRSSQRHSPPFSKTLPSKPSWPSEKARLTSTLPAIiTAIjQKPQ 

T VPPKPKGLLEDEADYWPVEDNDENYIHPTESSSPPPEKAPMVNRSTKPNSSTPASPP6 

T TASGRNSGAWETKSPPPAAPSPLPRAGKKPTTPLKTTPVASQQNASSVCEEKPIPAERH 

T RGSSHRQEAVQSPVFPPAQKQIHQKPIPLPRPTEGGNPTVDGPIjPSPSSITSTISEQEAG 

Z, VLCKPVnTAGACDRKSAEEALHRSNKDGSPLIRKSSGHDSKQPYTLVVFFNKRVYNIPVR 

T F I EATKQYALGRKKNGEEYFGS VAE I IRNHQHS PL VL XDSQNNTKDSTRLKYAVKVS « 

Q Sequence 1806 BP; 571 A; 448 C; 379 G; 408 T; 0 Other; 



acrtcrcrcccrcra 


acccfctcracra 


O U 


fccrctcracerfccr 


acrarhcrcraf 


±z u 


agcttaataa 


aafaarrahr 


ion 


atgatattaa 




A*± U 


cacctccaaa 


t*at" f~ nP^Prta 


300 


aQfccrcrtcccia 


t-y a.*- U L. LyaL 


•a ^ r% 


tataccrtaat 


y w^y^>^ya^ 




aggaaaccag 


gccggttcac 


480 


gatcaagcca 


gaggcattcc 


540 


cttcagagaa 


agcaaggctc 


600 


aagtcccacc 


caaacccaaa 


660 


aagataatga 


tgaaaactat 


72 0 


ctcccatggt 


gaatagatca 


780 


cagcttcagg 


tcgaaacagt 


840 


ccccgttgcc 


acgggccggg 


900 


ctcaacagaa 


tgcttcaagt 


960 


ggtcaagtca 


cagacaagaa 


1020 


tccaccaaaa 


acccatacct 


1080 


ggcccctacc 


cagcttttca 


1140 


gcaagccatg 


gtatgctgga 


1200 


caaacaagga 


tggatcattt 


1260 


atacactagt 


tgtattcttt 


1320 


caacaaaaca 


atatgccttg 


1380 


ctgaaatcat 


caggaatcat 


1440 


caaaagattc 


caccagactg 


1500 


tcaataccat 


tgcttcagac 


1560 


cttcatattt 


tggattatga 


1620 


agtggtcatc 


catttctttt 


1680 


gaactgttaa 


tatctggtga 


1740 


ttttattttt 


aaaaaaaaaa 


1800 



Homo sapiens cDNA FLJ12768 fis, clone NT2RP2001576 , weakly similar to 
HYPOTHETICAL 62.2 KD PROTEIN C4G8.12C IN CHROMOSOME I. 



/protein_id=«BAB14263 .1" 

/ trans 1 at i on= " MQ I CGS S VAS VAAGTS FQVLGPVCWQQIjDIjKMAVRVLWGGLSLLR 
VLWCLLPQTGYVHPDEFFQSPEVMAEDILGVQAARPWEFYPSSSCRSVIiFPIjLISGSTF 
WLLRLWEELGPWPGLVSGYALLVGPRLLLTALSFA^^ 

GSYVTLVFYTRTFSNTIEGLLFTWLIiVLVSSHVTWGPTRKEPAPGPRi^SWLL 

GFFNRPTFLAFAVVPIjYLWGTRGATNPGLKSLTREALVLIjPGATLTAAVF^ 

SPATSRNLVLTPVNFLHYNLNPQNIjARHGTHARLTI^ 

LQVGLQASAQMGLIiRALGARSLLSSPRSYIiLLLYFMPI^ 

VLIiCSPQTQPVPWKGTVmJFNALGALLFGCLHQGGLVPGIjEYLEQVVH^ 

TLLFTHTYMPPRHLLHLPGLGAPVEVVDMGGTEDWALCQTIi 

RLFVVTPGTTRRAVEKCSFPFKNETLIiFPHIjTIjEDPPALSSIiLSGAWRDHIiSIjHIVELG 
EET" 



Sequence 2687 BP; 454 
agtctccgcg ctgctgaggc 
cccgccgcct ccggggcctc 
tgaaggcgca gcccggaggc 
gcagctggga catcattcca 
atggcagtca gggtgctttg 
ccgcagacgg gctatgtgca 
gacatcctgg gcgttcaggc 
tcggtgctct tccccctgct 
gagctggggc cgtggcctgg 
ctcctcactg ccctttcctt 
ggggcggatc gctggaacgc 
tacacaagga ccttctccaa 
gtatcctccc atgtaacgtg 
cgcagctggc ttcttggagg 
gcctttgctg tggtccccct 
aagtctctga cccgggaggc 
tttgtggcca cggacagctg 
acacctgtca acttcctgca 
cacgcgcggc tcactcacct 
caggccctgc aggctgcgtg 
ggcctcctga gggcactggg 
ctcctctact tcatgcctct 
ctgattcccc tcctggtccc 
tggaagggca ctgtggtcct 
caggggggcc tggtgcctgg 
ccaagcacac ccacccacta 
ctcctacacc tcccaggcct 
gactgggccc tgtgccaaac 
ggtgggccat ggctctgccg 
gagaagtgca gcttcccctt 
gatccaccag ccctgtcctc 
attgtggagc tgggggaaga 
gccatagaag atgccgcccc 
accccgtcct ccttcatgac 
tgccttctgg gtgagctggc 
gatgtcaagg tccccaaaga 
cttccagcca ctgtgatgtt 
aactgctagt gacattcctc 
tgtcatgaca accctctggc 
gtgagacgaa gtatagagaa 



A; 883 C; 733 G; 617 T; 0 Other; 
gcgcccggcc gctcccacgg cctcccctcc 
ctgggaccct ggccctcgcc gggcaggacg 
cgcgcggatg cagatctgtg gatccagcgt 
ggttttgggc ccggtgtgtt ggcaacaact 
gggtggtctc agcctgctcc gagtgctgtg 
cccagatgag ttcttccagt cccctgaggt 
cgcgcggccc tgggagtttt accccagcag 
gatctctggt tccaccttct ggctgctcag 
cctggtgagc ggctatgcgc tgctggtggg 
tgctctggac ggggccgtgt accacctggc 
cctggccctg ctgtctggtt cctacgtcac 
caccattgag ggactcctct tcacgtggct 
gggccctaca cgcaaggagc cggcgccggg 
cattgtggct gctggcttct tcaaccggcc 
ctacctctgg ggcactcgtg gagccacaaa 
cctggtgctg ctccctgggg cgaccctcac 
gtatttctcc agccccgcga catccaggaa 
ctacaacctg aatccccaaa acctggcgag 
ggcagtcaac ggcttcctgc tcttcggggt 
gcaacagctg caagtcggcc tccaggcctc 
tgcccggagc ctgctgtcca gccccaggtc 
ggccctgcta tctgccttta gccaccagga 
cctggtcctg ctttgtagtc cacagacgca 
cttcaacgcc ctcggtgccc tcctcttcgg 
cctggagtac ctggagcagg tggtccatgc 
cacactcctc ttcactcaca cctacatgcc 
gggggcacca gtggaggtgg tggacatggg 
cctgaaaagc ttcaccagac aaccagcctg 
cctctttgtg gtaacccctg gcaccaccag 
caagaatgaa acacttttat ttccccatct 
cttgctgagt ggggcttgga gggaccacct 
aacctgacaa tatgacagag cacccactgc 
accttctact tgggtagctg ggctgggacg 
tcccactgct gcctctcctg ggcatggctg 
actcttctcc ctgagaccaa agatttgacc 
ccaggttaag tgacgacacc tgtctgttcc 
tgaaatatgt gatagtacct ggttgtgaaa 
aatgacctct cccaaacctc ccatgatgcc 
ttcctaagac ccatctgcct atcgaaatat 
caggtggccc agatccaggg gacccaactt 



gccctgcggt 60 

ccgccagcgc 12 0 

agcatctgta 180 

ggatctgaag 240 

gtgtctcctt 300 

gatggcagag 360 

ctcctgccgc 420 

gctctgggag 480 

gcctcgactc 540 

cccgccgatg 600 

cctggtcttc 660 

gctggtgctg 720 

tccacggtgg 780 

cacctttctg 840 

ccctggtttg 900 

agcagcggtg 960 

ccttgtcctg 102 0 

acatggcacg 1080 

gctgcatgcc 114 0 

tgcacaaatg 1200 

ctatctcctt 1260 

ggctcggttc 1320 

gcctgtgcct 1380 

ctgcctgcat 1440 

ccctgtgctc 1500 

cccccggcac 1560 

ggggac tgag 1620 

ccaagtggct 1680 

gcgtgccgtg 174 0 

gaccctggag 1800 

cagtcttcac 1860 

ccaagactca 1920 

ctgggacagg 1980 

ttagctgttc 2040 

tgtcggtctt 2100 

tgccctgttg 2160 

aaagacaa tg 2220 

ttacccttgc 2280 

gtgcaagtca 234 0 

ctggcccctt 2400 



ggtctgtcac ctcctcgctg tgtgatcttg agaaagctcc ttccactcac ccaccccact 2460 

tcccagtctg ttgggatcag aggaactttg aggtgtctgc cggctaacat tgtgtcattc 2520 

ctggagtcca cagtacacgt cccctgcctc aacaggcaca gctctcacaa agctcttcaa 2580 

gcatggaagt gggagttgtg ttgtacttca tggcactctg atgcctgctg tctcagtgtt 2640 
tggttattat gcaaacaagt aatgtttgaa atatataata gcactgg 



Homo sapiens glycine amidinotransf erase (L-arginine: glycine 
amidinotransf erase) , mRNA (cDNA clone MGC:1744 IMAGE : 3010128) , complete 

/protein_id= n AAH04141. 1" 

/ translat ion= " MLRVRCLRGGSRGAEAVHYIGSRLGRTLTGWVQRTFQSTQAATAS 
SRNS CAADDKATEPLPKDCP VS S YNEWDPLEEVT VGRAENACVPPFT IEVKANTYEKYW 
PFYQKQGGHYFPKDHLKKAVAEIEEMCNILKTEGV^^ 

YSAMPRDILIWGNEIIEAPMAWRSRFFEYRAYRSI IKDYFHRGAKWTTAPKPTMADEL 
YNQDYP IHSVEDRHKLAAQGKFVTTEFEPCFDAADF IRAGRDIFAQRSQVTNYIiGIEWM 
RRHIiAPDYRVHI I S FKDPNPMHIDATFNI I GPG IVLSNPDRPCHQ IDLFKKAGWTI I TP 
PTPII PDDHPLWMS S KWL SMNVLMIiDEKRVMVDANE VP I QKMFEKLG I TT I KVN IRNAN 
SLGGGFHCWTCDVRRRGTLQSYIiD n 

Sequence 2342 BP; 690 A; 490 C; 480 G; 682 T; 0 other; 

cgggaaggct tggaccgacg cggcccagag gccaggaaca ttccgcgcgt ggaccagccg 60 

ggccagggcg atgctgcggg tgcggtgtct gcgcggcggg agccgcggcg ccgaggcggt 120 

gcactacatc ggatctcggc ttggacgaac cttgacagga tgggtgcagc gaactttcca 180 

gagcacccag gcagctacgg cttcctcccg gaactcctgt gcagctgacg acaaagccac 240 

tgagcctctg cccaaggact gccctgtctc ttcttacaac gaatgggacc ccttagagga 3 00 

agtgatagtg ggcagagcag aaaacgcctg tgttccaccg ttcaccatcg aggtgaaggc 360 

caacacatat gaaaagtact ggccatttta ccagaagcaa ggagggcatt attttcccaa 420 

agatcatttg aaaaaggctg ttgctgaaat tgaagaaatg tgcaatattt taaaaacgga 48 0 

aggagtgaca gtaaggaggc ctgaccccat tgactggtca ttgaagtata aaactcctga 54 0 

ttttgagtct acgggtttat acagtgcaat gcctcgagac atcctgatag ttgtgggcaa 600 

tgagattatc gaggctccca tggcatggcg ttcacgcttc tttgagtacc gagcgtacag 660 

gtcaattatc aaagactact tccaccgtgg cgccaagtgg acaacagctc ctaagcccac 720 

aatggctgat gagctttata accaggatta tcccatccac tctgtagaag acagacacaa 780 

attggctgct cagggaaaat ttgtgacaac tgagtttgag ccatgctttg atgctgctga 840 

cttcattcga gctggaagag atatttttgc acagagaagc caggttacaa actacctagg 900 

cattgaatgg atgcgtaggc atcttgctcc agactacaga gtgcatatca tctcctttaa 960 

agatcccaat cccatgcata ttgatgctac cttcaacatc attggacctg gtattgtgct 1020 

ttccaaccct gaccgaccat gtcaccagat tgatcttttc aagaaagcag gatggactat 1080 

cattactcct ccaacaccaa tcatcccaga cgatcatcca ctctggatgt catccaaatg 1140 

gctttccatg aatgtcttaa tgctagatga aaaacgtgtt atggtggatg ccaatgaagt 1200 

tccaattcaa aagatgtttg aaaagctggg tatcactacc attaaagtta acattcgtaa 1260 

tgccaattcc ctgggaggag gcttccattg ctggacctgc gatgtccggc gccgaggcac 1320 

cttacagtcc tacttggact gaacaggcct gatggagctt gtggctggcc tcagatacac 1380 

ctaagaagct taggggcaag gttcattctc ctgctttaaa aagtgcatga actgtagtgc 1440 

tttaaacaat catctcctta acaggggtcg taagcctggt ttgcttctat tacttttctt 1500 

tgacataaag aaaataactt ctgctaggta ttactctcta ctcctaaagt tatttactat 1560 

ttggcttcaa gtataaaatt ttggtgaatg tgtaccaaga aaaaattagt cacctgagta 1620 

acttggccac taataattaa ccatctacct ctgtttttaa ttttctttcc aaaaggcagc 1680 

ttgaaatgtt ggtcctaatc ttaatttttt ttcctcttct atagacttga gaatgttttt 1740 

ctctaaatga gagaaagact tagaatgtac acagatccaa aatagaatca gattatctct 1800 

ttttttctaa aggagagaaa gacttagaac atacacagat cctaagtaga accaggtaat I860 

tgtctctttt tctaataagg aatttgggta atttttaatt ttttgttttt taaaaaataa 1920 

cctagactat gcaaaacatc aaagtgaatt ttccatgaat gtttttaata ttctcatctc 1980 

aacattgtga tatatgctac taaaaacctt ttcatataca tcttacctca tttcaagtga 2040 

attattttaa tctttttctc tctttccaaa aatttaggaa tgtttagtgt aattggattt 2100 

cgctatcagt tcccatcctt aagttttgat attcaatatc tgatagatac actgcatctt 2160 

tggtcatcta agatttgttt acaaatgtgc aaattattta gagcatagac tttataagca 2220 

ttaaaaaaaa ctaatggagg taaaacctaa atgcgatgtg aaataatttt agtgttgata 2280 

ccgtatgtgt atttttattc taataaactt ttgtgttcca gaaaaaaaaa aaaaaaaaaa 2340 

aa 



3E Homo sapiens cDNA FLJ10143 fis, clone HEMBA1003281, weakly similar to 
JE POLIOVIRUS RECEPTOR PRECURSOR. 



?T 
yip 

?T 
?T 
?T 
?T 
? T 

? T DRTARVSQPS 



/ trans 1 a t ion= n MGTQEGWCLLLCLALSGAAETKPHPAEGQWRAVDVV^ 
AHRGALAS S EDRARAS LVLKQVP VLDDGS LEDFTDFQGGTLAQDDPP 1 1 FEAS VDLVQ I 
PQAEALLHADCSGKEVTCE I SRYFLQMTETTVKTAAWFMANVQVSGRGPS I SLVMKTPR 
VAKNEALWHPTLNLPLSPQGTVRTAVEFQVMTQTQSL^ 

ISVEWRLQHKGRGQLVYSWTAGQGQAVRKGATLEPAQLGMARDASLTIiPGLTIQDEGTY 
ICQITTSLYRAQQIIQIjNIQASPKVRLSIiANEALLPTLICDIAGYYPIj^ 
GGS PAQVSGAS FS SLRQS VAGTYS I SS SLTAEPGSAGATYTCQVTHI S LEE PLGAS TQ V 
VPPERRTALGVI FAS SLFLIiALMFLGLQRRQAPTGLGLLQAERWETTS CADTQS SHLHE 
DRTARVSQPS" 



Sequence 1694 BP; 365 A; 514 C; 488 G; 327 T; 0 other; 



tgacattggt aaat 





fin 


cftcjocicaacfcr 




t ci c t c* t a r* c* t* 


1 Q n 


QQccrcfcrcaat 


240 


ctctcgccag 


300 


tggacgatgg 


360 


acccacctat 


420 


tgctccatgc 


480 


agatgacaga 


540 


gacggggacc 


600 


tctggcaccc 


660 


agttccaggt 


720 


tggactgtgg 


780 


agcacaaggg 


840 


ggaagggcgc 


900 


tgcccggcct 


960 


accgagctca 


1020 


tggcaaacga 


1080 


atgtggtggt 


1140 


cctccttctc 


1200 


ccgcagaacc 


1260 


aggagcccct 


1320 


tcatctttgc 


1380 


aagcacctac 


1440 


acacacagag 


1500 


ctaaagcgac 


1560 


tccaacccaa 


1620 


tcagaataca 


1680 



DE Homo sapiens leucine aminopeptidase 3, mRNA (cDNA clone IMAGE : 2821948) , 
partial cds. 

/translation » LAVRRFGSRSIiSTADMTKGLVLGIYSKEKEDDVPQFTSAGENFDK 
LLAGKLRETLNI SGPPLKAGKraTFYGIiHQDFPS VVLVGLGKKAAG IDEQENWHEGKEN 
IRAAVAAGCRQIQDLEIiSSVEVDPCGDAQAAAEGAVLGIjYEYDDIiKQKKKMAVSAKLyG 
SGDQEAWQKGVLFASGQNIiARQLMETPANEMTPTRFAEI IEKNLKS AS SKTEVHIRPKS 
WIEEQAMGSFLSVAKGSDEPPVFIiEIHYKGSPNANEPPLVFVGKGITFDSGGISIKASA 
NMDIiMRADMGGAAT ICS AI VS AAKLNIiP INI I GIAPLCENMPSGKANKPGD WRAKNGK 
T I QVDNTD AEGRIi I LAD AL C YAHT FNP KV I LNAATLTGAMD VALG SGATGVFTNS SWLW 
NKLFEAS IETGDRVWRMPLFEHYTRQWD CQLADVNN I GKYRS AGACTAAAFLKEFVTH 
PKWAHIlDIAGVMTNKDEVPYIJRKGMTGRPTRTIJIEFI*IJRFSQDNA ,l 



Sequence 193 8 BP; 603 
gtctggccgt gagacgtttc 
ttgttttagg aatctattcc 
gagagaattt tgataaattg 
cacctctgaa ggcagggaag 
tggtgctagt tggcctcggc 
aaggcaaaga aaacatcaga 
agctctcgtc tgtggaggtg 
tgcttggtct ctatgaatac 
agctctatgg aagtggggat 
agaacttggc acgccaattg 
ccgaaattat tgagaagaat 
ccaagtcttg gattgaggaa 
acgagccccc agtcttcttg 
ccctggtgtt tgttgggaaa 
ctgcaaatat ggacctcatg 
tcgtgtctgc tgcaaagctt 
aaaatatgcc cagcggcaag 
agaccatcca ggttgataac 
gttacgcaca cacgtttaac 
tggatgtagc tttgggatca 
acaaactctt cgaggccagc 
aacattatac aagacaggtt 
acagatctgc aggagcatgt 
agtgggcaca tttagacata 
ggaaaggcat gactgggagg 
aagacaatgc ttagttcaga 
gaacttaaaa ggtttttgaa 
tttaaaaatg tagaacacaa 
agatttataa aggtaaagtt 
tgattaaaat ttttagaact 
caaaattgta actcagattt 
cttgtcagaa acaataaatg 



A; 386 C; 470 G; 479 T; 0 Other; 
gggagccgga gtctctccac cgcagacatg 
aaagaaaaag aagatgatgt gccacagttc 
ttagctggaa agctgagaga gactttgaac 
actcgaacct tttatggtct gcatcaggac 
aaaaaggcag ctggaatcga cgaacaggaa 
gctgctgttg cagcggggtg caggcagatt 
gatccctgtg gagacgctca ggctgctgcg 
gatgacctaa agcaaaaaaa gaagatggct 
caggaggcct ggcagaaagg agtcctgttt 
atggagacgc cagccaatga gatgacgcca 
ctcaaaagtg ctagtagtaa aaccgaggtc 
caggcaatgg gatcattcct cagtgtggcc 
gaaattcact acaaaggcag ccccaatgca 
ggaattacct ttgacagtgg tggtatctcc 
agggctgaca tgggaggagc tgcaactata 
aatttgccca ttaatattat aggtctggcc 
gccaacaagc cgggggatgt tgttagagcc 
actgatgctg aggggaggct catactggct 
ccgaaggtca tcctcaatgc cgccacctta 
ggtgccactg gggtctttac caattcatcc 
attgaaacag gggaccgtgt ctggaggatg 
gtagattgcc agcttgctga tgttaacaac 
acagctgcag cattcctgaa agaattcgta 
gcaggcgtga tgaccaacaa agatgaagtt 
cccacaagga ctctcattga gttcttactt 
tactcaaaaa tgtcttcact ctgtcttaaa 
taaatggatg aaaatctttt aacggagaca 
tgaaatttgt atgccttgat ttttttttca 
aatatcttac ttgataagga tttttaagat 
tcctaatcac ttttcagagt atatgttttt 
gtgatgctag gaacatgagc aaactgaaaa 
caacttgttg tgctcaaaaa aaaaaaaaaa 



acgaagggcc 60 

acaagtgcag 12 0 

atatctggac 180 

ttccccagcg 240 

aactggcatg 300 

caagacctgg 360 

gagggagcgg 420 

gtgtcggcaa 480 

gcttctgggc 54 0 

accagatttg 600 

catatcagac 660 

aaaggatctg 720 

aacgaaccac 780 

atcaaggctt 84 0 

tgctcagcca 900 

cctctttgtg 960 

aaaaacggga 1020 

gatgcgctct 1080 

acaggtgcca 1140 

tggctctgga 1200 

cctctcttcg 1260 

attggaaaat 1320 

actcatccta 1380 

ccctatctac 1440 

cgtttcagtc 1500 

ttggacagtt 1560 

aaggatggta 1620 

tttcacacaa 1680 

actctataaa 1740 

cattgagaag 1800 

ttactatgca I860 

aaaaaaaaaa 1920 



E Homo sapiens irtRNA for protein phosphatase 4 regulatory subunit 2 (PPP4R2 
E gene) 



/ translation= n MCQAPCWRAGGSGLGRCSLCRSCSLARFPRLPSFPPPGRLRAGVC 
AREGEGVGGVGGGVPVPKRPAEGGGGCEGLREAT©VERLQEALK 

QFLCHVAKTGETMI QWS QFKGYF I FKLEKVMDDFRTS APEPRGP PNPNVE YI PFDEMKE 

RILKIVTGFNGIPFTIQRIiCELLTDPRRNYTGTDKFLRGVEKNVMVVSCVYPSSEra 

NSLNRI^GVMFPGNAPSYTERSNINGPGTPRPRmPKVSLSAPMTTN 



Sequence 2 
actgtacaaa 
caatagcatg 
cctcaccatg 
tcagaggcca 
cacctgctgc 
gtatcagcac 
actctcagat 
tgccaggcgc 
tcttgctctc 
gccggagtgt 
cccaaaagac 
gagaggctcc 
gtcctggatc 
caatttaaag 
gctcctgagc 
aaggaaagaa 
ctatgtgaat 
ggagtagaaa 
tccaatagtt 
gagaggtcta 
ctgtcagccc 
ttgcagcaaa 
tcagtgagcc 
gaggtaaaaa 
acttccagcg 
gataaagaca 
gatgaagagg 
aatcaagaaa 
aatcaaatgg 
agtgaaaacg 
agtaggaacc 
tgatgacgaa 
tacatgcagt 
aagtgcacct 
tcctgaaag 



TTS SE I S S VMVGETEAS S S SQDKDKDSRCTRQHCTEEDEEEDEEEEEE S FMTSREM I PE 
RJCNQEKESDDALTVNEETSEENNQMEESDVSQAEKDLLiHSEGSENEGPESKWFF" 

049 BP; 651 A; 409 C; 506 G; 483 T; 0 other; 
tgctttattt ctattcaata tttagaagac agttataaac 
gtggcagatg aacatcagga aggaacatcc atgagcttcc 
gatacgcttg tgatcaaggg cctggtctcc cctcaagaca 
caccatccta gcagtggagc agtaccagct gggacagggt 
atcaccaggc tgggtgaacg gacacaattg ccagaactca 
cgaaacctca caggaaaaat ggtaagttct aagtttctcc 
taatctctgt catccatcgc ttctccaaga aatgactttt 
catgttggag ggctggtggt agcggcttgg ggaggtgctc 
tcgcacgctt cccccggctc ccttcgtttc ccccccccgg 
gtgcgaggga gggggagggc gtcggggggg tggggggagg 
ccgcggaggg aggcggaggc tgtgagggac tccgggaagc 
aggaggcgct gaaagatttt gagaagaggg ggaaaaagga 
agtttctttg tcatgtagcc aagactggag aaacaatgat 
gctattttat tttcaaactg gagaaagtga tggatgattt 
caagaggtcc tcccaaccct aatgtcgaat atattccctt 
tactgaaaat tgtcactgga tttaatggta tcccttttac 
tgttaacaga tccaaggaga aactatacag gaacagacaa 
agaacgtgat ggttgttagc tgtgtttatc cttcttcaga 
taaatcgaat gaatggtgtg atgtttcctg gaaatgcacc 
atataaatgg gcctgggaca cccaggccac gtaatcgacc 
ccatgacaac aaatgggtgg cctgagagca cagacagcaa 
atgaggagaa aactcacagt gactcttcga catctgaatc 
ctttgagaaa taaacatcca gatgaagatg ctgtggaagc 
gactcaggtt tgacaaagaa ggtgaagtca gagaaacagc 
aaatttcttc agttatggta ggagaaacag aagcatcatc 
aagatagccg ttgtacccgg cagcactgta cagaagagga 
aagaagaaga gtcttttatg acatcaagag aaatgatccc 
aagaatctga tgatgcctta actgtgaatg aagagacttc 
aggaatctga tgtgtctcaa gctgagaaag atttgctaca 
aaggccctga aagtaagtgg ttcttctgac tgccgtgaaa 
aattcccagt aaaactggaa agaatctttc cagaatcatc 
gccacagaag tcaccgatga accactggaa caagactatt 
attttacaca cagttctggt tttaacactg tataaaactt 
ttagttttac aagtaaagca ggttgtaaaa taaagtactt 





c ft 


» <— »— ^— c*. y y cx d 


inn 

JL<, U 


»-yy L.L-cteciy ci 


T on 
loO 


tttLCty Lya 


240 


tagaotagoo 


o t\ n 


aULaaLay La 


O ^ ft 

360 


t. ayyyi-yaug 


420 


A /~« t - f - /~T+- r-*/~rrr 

civ i— y i_- cjy y 


480 




540 




C ft ft 

oOO 


catggacgtc 


660 


agtttgtcct 


720 


tcagtggtcc 


780 


cagaacttca 


840 


tgatgaaatg 


900 


tattcagcga 


960 


atttctcaga 


1020 


gagaaacaat 


1080 


aagctatact 


1140 


aaaggtttct 


1200 


agaggcaaat 


1260 


agaagtttcc 


1320 


tgaggggcat 


1380 


cagtcaaacg 


1440 


ttcatctcag 


1500 


tgaagaagag 


1560 


agaaagaaaa 


1620 


tgaagaaaat 


1680 


ttctgaaggt 


1740 


cagaaaaatt 


1800 


ccatggataa 


1860 


tagaaacatt 


1920 


ttatgtaaaa 


1980 


tatggataat 


2040 




Human mRNA for (2 '-5') oligo A synthetase E (1,6 Kb RNA) 

/trans 1 at ion= » MMDLRNTPAKSIjDKFIEDYIjLPDTCFRMQIDHAIDI I CGFLKERC 
FRGSS YPVCVS KVVKGGS SGKGTTLRGRSDADLVVFIiSPIjTTFQDQLNRRGEF IQE IRR 
QLEACQRERALSVKFEVQAPRWGOTRALSFVLSSLQLGEGVEFDVLPAFDAIjGQLTGSY 
KPNPQIYVKLIEECTDLQKEGEFSTCFTELQRDFL^ 

LGKLPPQYALEIXTWAWERGSMKTHFNTAQGFRTVLELVINYQQLCIYWT 

IIEKYLRRQLTKPRPVILDPADPTGNLGGGDPKGWRQIiAQEAEAWIJT^ 

SSWILLVRPPASSIiPFIPAPIiHEA 11 



Sequence 1322 BP; 334 A; 353 C; 320 G; 315 T; 0 other; 
gaggcagttc tgttgccact ctctctcctg tcaatgatgg atctcagaaa taccccagcc 
aaatctctgg acaagttcat tgaagactat ctcttgccag acacgtgttt ccgcatgcaa 
atcgaccatg ccattgacat catctgtggg ttcctgaagg aaaggtgctt ccgaggtagc 
tcctaccctg tgtgtgtgtc caaggtggta aagggtggct cctcaggcaa gggcaccacc 
ctcagaggcc gatctgacgc tgacctggtt gtcttcctca gtcctctcac cacttttcag 
gatcagttaa atcgccgggg agagttcatc caggaaatta ggagacagct ggaagcctgt 
caaagagaga gagcactttc cgtgaagttt gaggtccagg ctccacgctg gggcaacccc 
cgtgcgctca gcttcgtact gagttcgctc cagctcgggg agggggtgga gttcgatgtg 
ctgcctgcct ttgatgccct gggtcagttg actggcagct ataaacctaa cccccaaatc 
tatgtcaagc tcatcgagga gtgcaccgac ctgcagaaag agggcgagtt ctccacctgc 
ttcacagaac tacagagaga cttcctgaag cagcgcccca ccaagctcaa gagcctcatc 
cgcctagtca agcactggta ccaaaattgt aagaagaagc ttgggaagct gccacctcag 
tatgccctgg agctcctgac ggtctatgct tgggagcgag ggagcatgaa aacacatttc 
aacacagccc aaggatttcg gacggtcttg gaattagtca taaactacca gcaactctgc 
atctactgga caaagtatta tgactttaaa aaccccatta ttgaaaagta cctgagaagg 
cagctcacga aacccaggcc tgtgatcctg gacccggcgg accctacagg aaacttgggt 
ggtggagacc caaagggttg gaggcagctg gcacaagagg ctgaggcctg gctgaattac 
ccatgcttta agaattggga tgggtcccca gtgagctcct ggattctgct ggtgagacct 
cctgcttcct ccctgccatt catccctgcc cctctccatg aagcttgaga catatagctg 
gagaccattc tttccaaaga acttacctct tgccaaaggc catttatatt catatagtga 
caggctgtgc tccatatttt acagtcattt tggtcacaat cgagggtttc tggaattttc 
acatcccttg tccagaattc attcccctaa gagtaataat aaataatctc taacaccaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



E Homo sapiens A-kinase anchoring protein 18 beta mRNA, complete cds . 



T 
T 
T 
X 
Q 



/ trans la t ion= n MGQLCCFPFSRDEGKI SELESSSSAVLQRYSKDIPSWSSGEKNGG 
EPDDAELVRLS KRLVENAVLKAVQQYIjEETQNKNKPGEGS S VKTEAADQNGNDNENNRK 



Sequence 463 BP; 139 A; 106 C; 132 G; 86 T; 0 other; 
gctcgcagac tgtgctataa actgcaattt ctatttgggg tcctcacgga 
aggaaagaca gacaggacca gtgccatggg ccagctttgc tgctttcctt 
tgaaggaaaa atcagtgagt tggaaagctc gtcctctgca gtcctacaaa 
ggatataccc agttggtcaa gtggtgaaaa gaacggaggg gagcccgatg 
agtaaggctc agtaagaggc tggtggagaa cgcggtgctc aaggctgtcc 
ggaggaaaca cagaataaaa acaagccggg ggaggggagc tctgtgaaaa 
tgatcagaat ggcaatgaca atgagaacaa caggaaatga gcccggaacg 
tgtctctgtg caaagcctcc ctgcttccct ctgctgagtc tag 



gaagaacacc 
tctcaagaga 
gatacagcaa 
acgctgaact 
agcagtatct 
ccgaagcagc 
caggccccca 



60 
120 
180 
240 
300 
360 
420 




Homo sapiens peptidyl prolyl isomerase H (cyclophilin H) , mRNA (cDNA clone 

/ trans la tion= " MAVANSSPVNPVVFFDVS IGGQEVGRMKIELFADVVPKTAENFRQ 
FCTGEFRKDGVPIGYKGSTFHRVIKDFMIQGGDFWGDGTGVASIYRGPFi^ 
SAPGLIiSMANSGPSTNGCQFFITCS KCDWIiDGKHWFGKI IDGLLVMRKIENVPTGPNN 
KPKLPWT S QCGEM " 

Sequence 765 BP; 199 A; 156 C; 200 G; 210 T; 0 other; 
cttctgcttc cgggtcggag ccatggcggt ggcaaattca agtcctgtta accccgtggt 
gttctttgat gtcagtattg gcggtcagga agttggccgc atgaagatcg agctctttgc 
agacgttgtg cctaagacgg ccgagaactt taggcagttc tgcaccggag aattcaggaa 
agatggggtt ccaataggat acaaaggaag caccttccac agggtcataa aggatttcat 
gattcagggt ggagattttg ttaatggaga tggtactgga gtcgccagta tttaccgggg 
gccatttgca gatgaaaatt ttaaacttag acactcagct ccaggcctgc tttccatggc 
gaacagtggt ccaagtacaa atggctgtca gttctttatc acctgctcta agtgcgattg 
gctggatggg aagcatgtgg tgtttggaaa aatcatcgat ggacttctag tgatgagaaa 
gattgagaat gttcccacag gccccaacaa taagcccaag ctacctgtgg tgatctcgca 
gtgtggggag atgtagtcca gacaaagact gaatcaggcc ttcccttctt cttggtggtg 
ttcttgagta agataatctg gactggcccc cgtctttgct tccctgcctg ctgctgcccc 
atttgatcaa gagaccatgg aagtgtcaga gattcagaat ccaagattgt ctttaagttt 
aactgtaa ataaagtttt tttgtatgcg taaaaaaaaa aaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



E 
E 



Homo sapiens mRNA; cDNA DKF2p564C0362 (from clone DKFZp564C0362) ; complete 
cds 



polyA signal 
polyA site 



/ 1 r an s 1 a t i on= " MYGKGKSNS S AVPSDS QAREKLALYVYEYLLHVGAQKSAQTFLSE 
IRWEKNITLGEPPGFLHSWWCVFWDLYCAAPERRETCEHSSEA^ 

GNlPPGDGMPVGPVPPGFFQPFMSPRYPGGPRPPIiRIPNQALGGVPGSQPLLPRGMDPT 
RQQGHPIMGGPMQROTPPRGMVPLGPQNYGGAMRPPI^^ 

NPTNANSIPYSSASPGNYVGPPGGGGPPGTPIMPSPADSTNSGD^^ 
PNFPMGPGSDGPMGGLGGMESHHMNGSLGSGDMDS ISKNS PNNMSLSNQPGTPRDDGEM 
GGNFLNPFQSESYSPSMTMSV 11 



1685. 
1711 



. 1690 



Sequence 1731 BP; 513 A; 385 C; 392 G; 441 T; 0 other; 
gggggaggct gtgatgggtt gacaggtgcg tgacagtggg agctgctctc 
tgtacggcaa aggcaagagt aacagcagcg ccgtcccgtc cgacagccag 
agttagcact ctacgtatat gaatatctgc tccatgtagg agctcagaaa 
catttttatc agagataaga tgggaaaaaa acatcacatt gggggaacca 
tacattcttg gtggtgtgta ttttgggatc tctactgtgc agctccagag 
catgtgaaca ctcaagtgaa gcaaaagcct tccatgatta cagtgctgca 
gtccagtgct aggaaacatt cccccaggag atggcatgcc agtaggtcct 
ggttctttca gccttttatg tcacctcggt accctggagg tccaaggccc 
tacctaatca ggcacttgga ggtgtcccag gaagtcagcc attactcccc 
atccaactcg acaacaagga catccaaata tgggtgggcc aatgcagaga 
caagaggaat ggtgccctta ggaccacaga actatggagg tgcaatgaga 
atgctttagg tggccctgga atgcctggaa tgaacatggg tccaggtggt 
ggccaaaccc aacaaatgcc aattcaatac catactcctc agcatctcct 
taggtcctcc aggaggtgga gggccaccag gaacacccat catgcctagt 
caaccaactc tggtgataac atgtatactt taatgaatgc agtacctcct 
gacctaattt tccaatgggc cctgggtcag atggtcccat gggtggatta 
agtcacatca catgaatggc tctttaggct caggagatat ggacagtatt 
ctcccaataa tatgagcctg agtaatcaac cgggcactcc aagggatgat 
ggggaaattt cttaaatcct tttcagagtg agagttactc ccctagcatg 
tgtgatccat taccaagtct cctcatgaaa accacagtga gtcagccctt 
ctacggaaga aaattattca tcacagtgta cagttaaaca aaggaatctc 
aaccaacctt ttcatttcct gctctctccc ctcttttgtg aagaaagcgg 
gattcaaaca actgtacgga gtggcatatt agaattgccc taaactgaac 
tatgtgtgta tgtatatgtg tgggaaagag aatgtactgt atatgtgtat 
catatacaca tacatacatt gacccacagg acattgtaaa atattatcac 
aagtagaaat aagtagggac ttttattcca tccttttttt cacgtttaca 
tacaagttgc tcctgccccc tccctgaact attttgtgct gtgtatatca 
taagttattt tttaaggtga actcagatgt tatggttttg tatatgtctg 
taggaataaa atcgcttatt tgagagcttt caaaaaaaaa aaaaaaaaaa c 



ggcacaagca 


€0 


gcccgggaga 


120 


tcagctcaaa 


180 


ccaggattct 


240 


agacgtgaaa 


300 


gcagctccca 


360 


gtaccaccag 


420 


ccattgagga 


480 


agaggaatgg 


540 


atgactcctc 


600 


cccccactga 


660 


ggtagacctt 


720 


gggaattatg 


780 


ccagcagatt 


840 


ggacctaaca 


900 


ggaggaatgg 


960 


tccaagaatt 


1020 


ggcgaaatgg 


1080 


acaatgagcg 


1140 


cacagaacta 


1200 


agtcacacca 


1260 


gtccagatgt 


1320 


tgcaaataat 


1380 


gttatacaga 


1440 


atgacatctt 


1500 


ttttaattat 


1560 


ctgctttata 


1620 


caatcatgga 


1680 



Human interferon- induced cellular resistance mediator protein (MxB) mRNA, 
complete cds. 

/translation=«MSKAHKPWPYRIO^QFSSRKyLKKEMNSFQQQPPPFGTVPPQMMF 

PPNWQGAEKDAAFLAKDFNFLTIiNNQPPPGNRSQPRAMGPENNIiYSQYEQKVRPCIDIjI 

DSLPJUJGVEQDLALPAIAVIGDQSSGKSSVI.EALSGVALPRGSGIVTRCPLVLKLKKQP 

CEAWAGRI S YRNTELELQDPGQVEKEIHKAQNVMAGNGRGI SHEL I SLE ITS PEVPDLT 

I IDIjPG I TRVAVDNQPRD IGLQ I KAL I KKY I QRQQT INIj WVP CNVD I ATTEAL SMAHE 

VDPEGDRTIGILTKPDIjMDRGTEKSVMNVVRWLTyPI>KKGYMIVKCRGQQEITNRliSIiA 

EATKKEITFFQTHPYFRVLLEEGSATVPRLAERLTTELIMHIQKSLPLLEGQIRESHQK 

ATEEIjRRCGADIPSQEADKMFFIiIEKIKMFNQDIEKIjVEGEEVVRENETRIjYNKIREDF 

KNWVGIIiATfTTQK^n<lJIIHEEVEKyEKQYRGKEIiLGFVNYKTFEIIVHQYIQQIjVEPAL 

SI^QKA^IIQQAFINVAKKHFGEFFNIjNQTVQSTIEDIKVKHTAKAENMIQLiQFRMEQ 

MVFCQDQI YS WIjKKVREE I FNPLGTPSQNMKLjNSHFP SNES SVS S FTE IGIHIjNAYFIj 
ETSKRI^QIPFIIQYF^RENGDSLQKAMMQII.QEK1TOYSWI J LQEQSETATKRRII.KE 

RIYRLTQARHALCQFSSKEIH" 

Sequence 2961 BP; 826 A; 754 C; 721 G; 660 T; 0 other; 
aagagatgat ttctccatcc tgaacgtgca gcgagcttgt caggaagatc ggaggtgcca 
aatagcagag aaagcatccc ccagctctga cagggagaca gcacatgtct aaggcccaca 
agccttggcc ctaccggagg agaagtcaat tttcttctcg aaaatacctg aaaaaagaaa 
tgaattcctt ccagcaacag ccaccgccat tcggcacagt gccaccacaa atgatgtttc 
c?ccaaactg gcagggggca gagaaggacg ctgctttcct cgccaaggac ttcaactttc 
tcactttgaa caatcagcca ccaccaggaa acaggagcca accaagggca atggggcccg 
agaacaacct gtacagccag tacgagcaga aggtgcgccc ctgcattgac ctcatcgact 
ccctgcgggc tctgggtgtg gagcaggacc tggccctgcc agccatcgcc gtcatcgggg 
accagagctc gggcaagagc tctgtgctgg aggcactgtc aggagtcgcg cttcccagag 
gcagcggaat cgtaaccagg tgtccgctgg tgctgaaact gaaaaagcag ccctgtgagg 
catgggccgg aaggatcagc taccggaaca ccgagctaga gcttcaggac cctggccagg 
tggagaaaga gatacacaaa gcccagaacg tcatggccgg gaatggccgg ggcatcagcc 
atgagctcat cagcctggag atcacctccc ctgaggttcc agacctgacc atcattgacc 
ttcccggcat caccagggtg gctgtggaca accagccccg agacatcgga ctgcagatca 
aggctctcat caagaagtac atccagaggc agcagacgat caacttggtg gtggttccct 
gtaacgtgga cattgccacc acggaggcgc tgagcatggc ccatgaggtg gacccggaag 
gggacaggac catcggtatc ctgaccaaac cagatctaat ggacaggggc actgagaaaa 
gcgtcatgaa tgtggtgcgg aacctcacgt accccctcaa gaagggctac atgattgtga 
agtgccgggg ccagcaggag atcacaaaca ggctgagctt ggcagaggca accaagaaag 
aaattacatt ctttcaaaca catccatatt tcagagttct cctggaggag gggtcagcca 
cggttccccg actggcagaa agacttacca ctgaactcat catgcatatc caaaaatcgc 
tcccgttgtt agaaggacaa ataagggaga gccaccagaa ggcgaccgag gagctgcggc 
gttgcggggc tgacatcccc agccaggagg ccgacaagat gttctttcta attgagaaaa 
tcaagatgtt taatcaggac atcgaaaagt tagtagaagg agaagaagtt gtaagggaga 
atgagacccg tttatacaac aaaatcagag aggattttaa aaactgggta ggcatacttg 
caactaatac ccaaaaagtt aaaaatatta tccacgaaga agttgaaaaa tatgaaaagc 
agtatcgagg caaggagctt ctgggatttg tcaactacaa gacatttgag atcatcgtgc 
atcagtacat ccagcagctg gtggagcccg cccttagcat gctccagaaa gccatggaaa 
ttatccagca agctttcatt aacgtggcca aaaaacattt tggcgaattt ttcaacctta 
accaaactgt tcagagcacg attgaagaca taaaagtgaa acacacagca aaggcagaaa 
acatgatcca acttcagttc agaatggagc agatggtttt ttgtcaagat cagatttaca 
gtgttgttct gaagaaagtc cgagaagaga tttttaaccc tctggggacg ccttcacaga 
atatgaagtt gaactctcat tttcccagta atgagtcttc ggtttcctcc tttactgaaa 
taggcatcca cctgaatgcc tacttcttgg aaaccagcaa acgtctcgcc aaccagatcc 
catttataat tcagtatttt atgctccgag agaatggtga ctccttgcag aaagccatga 
tgcagatact acaggaaaaa aatcgctatt cctggctgct tcaagagcag agtgagaccg 
ctaccaagag aagaatcctt aaggagagaa tttaccggct cactcaggcg cgacacgcac 
tctgtcaatt ctccagcaaa gagatccact gaagggcggc gatgcctgtg gttgttttct 



60 
12 0 
180 
240 
300 
360 
42 0 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 



tgtgcgtact cattcattct aaggggagtc 

aaactcttct gtcactatca gtgtccatct 

atcaggggtc cacacaggct cagctctctc 

agggatggct ctccagtcct tgggtcccgt 

ctatcattct tcgctaattt gtatttgtat 

agagggggaa ggtctgggtc aaattcttct 

gcagcattgt aactgcttaa taaatgacat 

atggagatac ctgaggctat tgctcaagcc 

ggtccctttc agatccagtg gccatgcccc 

tcccagcctc tccactcccc cgccagaaag 

tccctgatta tgatgagctt ccattgttct 
aaagaaactt tttaaaaacg t 



ggtgcaggat 
ctactgtact 
caccacccag 
agcacacagt 
tcccttcccc 
tttgtatgtc 
ctcactgaac 
caggccttgg 
ctgcttccca 
gagcctgagt 
gttaagtctt 



gccgcttctg 
ccctcagcat 
ctcttccctg 
tacagtgtcc 
ctacaagatt 
cagtctcctg 
gaatgagtgc 
acatttagtg 
tggttcactg 
gattctcttt 
gaagaggaat 



ctttggggcc 
cagagcatgc 
accttcacga 
taagatactg 
atgagacccc 
cacagcacct 
tgtgtaagtg 
actgttagcc 
tcattgtgtt 
tcttcttgtt 
ttaataaagc 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 



U90547; 

. 70, Last updated, Version 4) 



Human Ro/SSA ribonucleoprotein homolog (RoRet) mRNA, complete cds. 

/ trans lat ion= " MASTTSTKKMMEEATCSICIiSIjMTNPVSINCGHSy CHLC ITDFFK 
' ' NPSQKQIjRQBTFCCPQCRAPFHMDSLRPNKQIjGSLIEAIjKETDQEMSCEEHGEQFHLFC 
EDEGQLICTOCERAPQHKGHTTALVEDVCQGYKEKLQKAVTKLKQLFJ3RCTEQKLSTAM 
RITKWKEKVQIQRQKTRSDFKNIiQCFIiHEEEKSYLWRLEKEEQQTLSRLMYEAGIiGIiK 
SNELKSHILELEEKCQGSAQKLLQimroTLSRSWAVKLETSEAVSLELHTMCNVSKLYF 
DVKKMIjRSHQVSVTLDPDTAElHELILSEDRRQVTRGYTQENQDTSSRRFTAFPCVIiGCE 
GFTSGRRYFEVDVGEGTGWDIiGVCMENVQRGTGMKQEPQSGFWTIjRLCKKKGYVALTSP 
PTSIiHIiHEQPIiLiVGI FUD YE AGWS FYNGNTGCHI FTFPKAS FSDTIiRP YFQVYQYS PIi 



FliPPPGD' 

Sequence 2872 BP; 892 A; 584 C; 688 G; 708 T; 0 other; 
qacccacgcg tccgaaaagc tatggcctca accaccagca ccaagaagat gatggaggaa 
qccacctgct ccatctgcct gagcctgatg acgaacccag taagcatcaa ctgtggacac 
aqctactgcc acttgtgtat aacagacttc tttaaaaacc caagccaaaa gcaactgagg 
caggagacat tctgctgtcc ccagtgtcgg gctccatttc atatggatag cctccgaccc 
aacaagcagc tgggaagcct cattgaagcc ctcaaagaga cggatcaaga aatgtcatgt 
gaggaacacg gagagcagtt ccacctgttc tgcgaagacg aggggcagct catctgctgg 
cgctgtgagc gggcaccaca gcacaaaggg cacaccacag ctcttgttga agacgtatgc 
cagggctaca aggaaaagct ccagaaagct gtgacaaaac tgaagcaact tgaagacaga 
tqtacggagc agaagctgtc cacagcaatg cgaataacta aatggaaaga gaaggtacag 
attcagagac aaaaaatccg gtctgacttt aagaatctcc agtgtttcct acatgaggaa 
gagaagtctt atctctggag gctggagaaa gaagaacaac agactctgag tagactgagg 
gactatgagg ctggtctggg gctgaagagc aatgaactca agagccacat cctggaactg 
gaggaaaaat gtcagggctc agcccagaaa ttgctgcaga atgtgaatga cactttgagc 
aggagttggg ctgtgaagct ggaaacatca gaggctgtct ccttggaact tcatactatg 
tgcaatgttt ccaagcttta cttcgatgtg aagaaaatgt taaggagtca tcaagttagt 
gtgactctgg atccagatac agctcatcac gaactaattc tctctgagga tcggagacaa 
gtgactcgtg gatacaccca ggagaatcag gacacatctt ccaggagatt tactgccttc 
ccctgtgtct tgggttgtga aggcttcacc tcaggaagac gttactttga agtggatgtt 
ggcgaaggaa ccggatggga tttaggagtt tgtatggaaa atgtgcagag gggcactggc 
Itgaagcaag agcctcagtc tggattctgg accctcaggc tgtgcaaaaa S^aaggctat 
gtagcactta cttctccccc aacttccctt catctgcatg agcagcccct gcttgtggga 
atttttctgg actatgaggc cggagttgta tccttttata acgggaatac tggctgccac 
atctttactt tcccgaaggc ttccttctct gatactctcc ggccctattt ccaggtttat 
caatattctc ctttgtttct gcctccccca ggtgactaag gaaaagagca gaagctcctt 
ggtttaacca gcacagagaa aataatataa atcccataag ggcagacgtt tggtctgttt 
tcttcgctgt catttcctta gtagttagac tagtgctgag attttagtgg atatataatt 
gatttatgtt gaatatatgg acttagcaac taaaaatacc acagatggtt aacctggact 
ggggcaaagc aagataatag tgatgatcgt atgttgctgt ctccatccgt ctttaatggg 
tcagggcttt gatttccaag ggtcttcagg tgatgagtag gggtacccac aagtcagaag 
qtctgcgttc tcctagtttg tttgctgcca tttgaactca tgtagggaat gaaagaaagc 
tgcaattatc cgccaactgc atttaaaaca aaacaaaaca gaaaaatcaa aataacattg 
actcttccaa ccactgacat gttgtttaat aatctaagcg gcagtcctgg aggctaccag 
acttactgag ttctacctga gaaacagcca agcaaagtgt gagagaaggg ttaagactgg 
cttacaatga gatgcttcaa atgaaaaggg aattatgagt aaaattgaac tttgatgggg 
gattcagttc tggaaaagaa tttggtattt tccagtctgc taggaccaat taccttgaaa 
tattttaaaa tctcagtaaa tagttattgc tgaaatggct gttggcagtt cttattatga 
ttcagagaag agcaaataga ccttaacttc attttgaaaa agaccaaatt accatacccg 
agtgagtaat gacaggacta caactaaaac ataaacaaca ttaatgatga ccataaaaag 
tcacaaaatt gctaaatgtt ataatttaga gttgacataa aaattgatgg ccaggcatgg 
tggctcacgc ctgtaatccc agaactatgt gaggctgagg caggtggatc acttgaggtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



// 



aggagttcaa caccagcctg gccaacatgg tgaaaccctg tctctactaa aaatacaaaa 2460 

attagccggg catggtggta ggggcctgta acccagctac tcgtgaggcc aaggcaggag 252 0 

aattgcttga gcctgcagca gctgcagtaa gccaagatca tgctgtgcct caaggaSaa till 
aaaaattaat gtttactgat atttgttgaa gtcctacaac atcacc?ctg agalLggag 

aaatgaagca acagttgtgt ctagatgtca gaggcatggc tgggcctcca tltctgfltl 2700 

JLllllTtll aaactattgc ccatgttccc cagggtcaga agttctaatt 2760 

lllV^llll lllltl^l Staagtagta agtgaagggt agcagaatat gccatctttg 2820 

gcataagaag tattttgagt tgaagacaat tgagaaaaaa aaaaaaaaaa aa 2872 



Homo sapiens cDNA FLJ10465 fis, clone NT2RP1001616 . 

actctgctgc cggcttctcg gagcggcgct gggcgaccag agcagggtcg agatgtccta 60 
catcccgggc cagccggtca ccgccgtggt gcaaagagtt gaaattcaca agctgcgtca 120 
aggtgagaac ttaatcctgg gtttcagcat tggaggtgga atcgaccagg acccttccca 18 o 

gaatcccttc tctgaagaca agacggacaa ggtgaggggg tctggggtcc tgggaccgct 240 
ccatggggca caggggcctg agatggtggg tctctgcttc ctgggcctgc atggaaggaa 
cagacttcat ctctcaaacc atgctctcta agaaggcatc ggaagtgacc tagtgagaat 
aaggacgggt ggggtgagga agggctgctc agacagagcc caggaggagc aggaggcggc 
catcagcagg gccggtgcat ggtggtgcag caactctgcc ccggctctct cagaacagtc 
ctcactgacc atatgtgctg ggagaggctg ggtgcaggga cagagggacg gctgagaatg 
tgccatgctg gcttccgctg tgtgataagg ggccagtcca gtgaccacag ggcttgactt 
gggctgcccc tttccagggt atttatgtca cacgggtgtc tgaaggaggc cctgctgaaa 
tcgctgggct gcagattgga gacaagatca tgcaggtaac aggtgtccca aaggaggaga 
aataaggttt gggcaagcag gtctgaagca cttgggggtg gggagctgcc ccagcctccc 
agctgggaga gactcactgc agccaattgg gaacccatac tggcattgcc ccagaggacg 
ctggctttct ctcctgtgtg tctcagccac agtgcttggt gtctcccagc cctgggatgt 
taggctgggg gcctacttga atgacctggg cccccaaagc cctctgcttc cagatcccag 
agggcgggga gctgaggtga gcctgtgttc tctcctgggg ccaggtgaac ggctgggaca 
tgaccatggt cacacacgac caggcccgca agcggctcac caagcgctcg gaggaggtgg 
tgcgtctgct ggtgacgcgg cagtcgctgc agaaggccgt gcagcagtcc atgctgtcct 
agcagccacc accatctgcg actcctgcct gccgcctctc tgtacagtaa cgccacttcc 
acactctgtc cccatctggc ttctgctgac cgctgggccc cagctcagaa gggctatagc 
tggtcccaga ggcctggcct ggccttcctt cccttctccc atccctggcc tggggcctct 
gggaccagct ttctctcctg gacaccgagg attggaaata agggcctgga gctgagtagt 
agccagtctg ctgtgaccac aggctcaggt ccgaccctgc tgcttggcca cagcagtggc 
tgggcaagtg ggaaccacta tctcttggga gcccccaaaa gctgggaaat gctggaggaa 
ccaggccttt cccgcttttg cctggctgca gggttcggct ccgcccctgc cccccagccc 
tcgtgtgtcc acaccgcagt gcctctgccc ctcgggggac tggacacaca tcctgccaga 
ggcgctacga agctttgccc agatgaagcc aggtgggctc cgcgttcact cccactctcc 
cgaggggtgc tggcctcccc agggtttgcc ttcttacgga tttagacgag gttcgaggct 
cacctatcag ggcagctctc aggattgtca ttttcctctt tgcctgtggg tttaactttt 
gtattttttt aatcacaagt ttgatacaaa atgtttttat cgt 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1843 



« col^lete^cds hiSt ° ne *' H2aa ' mRNA (CDNA clone MSC:2238 IMAGE : 3536984 ) , 

/translation=«MSGRGKQGGKARAKAKSRSSRAGLQFPV6RVHRLLRKGNYAERVG 



'T 

2 AGAPVYMAAVLEYLTAEILELAGNAARDNKKTRI I PRHLQIAIRNDEELN^GKVTIA 

QGGVLPNIQAVLLPKKTESHHKAKGK" 



Sequence 567 BP; 136 A; 171 C; 168 G; 92 T; 0 other; 

ccaggcagga gtttctctcg gtgactacta tcgctgtcat gtctggtcgt ggcaagcaag 60 

gaggcaaggc ccgcgccaag gccaagtcgc gctcgtcccg cgctggcclt lagttcccgg 120 

25TT 9CatCgCtt 9 «=tgcgcaaag gcaactacgc ggagcgagtg ggigccgglg HI 

afaacaS* ° at " Ct9C9 3tcctcgagt atctgaccgc cgagailcctg glgltggcgg III 

foSSS, tC99gacaac aa 9aagacgc gcatcatccc tcgtcacctc cagctggcfa 300 

tccgcaacga cgaggaactg aacaagctgc tgggcaaagt caccatcgcc cagggcggcg 360 

tcttgcctaa catccaggcc gtactgctcc ctaagaagac ggagagtLc caSlgglaa III 

2SS* 9 a93Ct9aC9t «39°ccaag tgggcccagc ccggcccgcg tctcglaggg" 48? 

9 9 * actcaaaa9 9 ctcttttcag agccacccac gttttcaL? aaaagag??! til 

ttaatgctga aaaaaaaaaa aaaaaaa y y y D4tu 



Homo sapiens transcription factor ISGF-3 mRNA, complete cds. 

transcription factor. 

/ trans lat ion= « MSQV^LQQIiDSKFLEQVHQLYDDSFPMEIRQYIiAQWLEKQDWEH 

AANDVSFATIRFHDLLSQLDDQYSRFSLENNFLL^ 

IYSCLKEERKILEmQRFNQAQSGNIQSTVI^^ 

EDLQDEYDFKCKTLQNREHETNGVAK^ IELLN 
VTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQ 

ELEQKYTYEHDP I TKNKQVLWDRTFSLFQQL I QS S F WERQP CMPTHPQRPIiVLKTGVQ 
FTTOLRLLVKLQEIjNYNLKVKVLFD 

IJ^FRHIiQIiKEQKNAGTRTNEGPLIVTEELHSLSFETQIiCQPGIiVIDIiETTSLPVVVI 
SNVSQLPSGWASILWYNMLVAEPI^ 

QLNMLGEKLIjGPNAS PDGL I PWTRFCKENINDKNTFPFWLW IE S I IiEIi IKKHLLPIiWNDG 
CIMGFISKERERALIiKDQQPGTFIiLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT 
KKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD 
GPKGTGYIKTELISVSEVHPSRLQTTDNLLPMSPEEFDEVSRIVGSVEFDSMMNTV" 

Sequence 4003 BP; 1173 A; 812 C; 883 G; 1135 T; 0 other; 

attaaacctc tcgccgagcc cctccgcaga ctctgcgccg gaaagtttca tttgctgtat 60 

gccatcctcg agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc 120 

ttggcaccta acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg 180 

ggcacaaggt ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct 240 

ggagcaggtt caccagcttt atgatgacag ttttcccatg gaaatcagac agtacctggc 300 

acagtggtta gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat 360 

ccgttttcat gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa 420 

taacttcttg ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca 4 80 

ggaagaccca atccagatgt ctatgatcat ttacagctgt ctgaaggaag aaaggaaaat 540 

tctggaaaac gcccagagat ttaatcaggc tcagtcgggg aatattcaga gcacagtgat 600 

gttagacaaa cagaaagagc ttgacagtaa agtcagaaat gtgaaggaca aggttatgtg 660 

tatagagcat gaaatcaaga gcctggaaga tttacaagat gaatatgact tcaaatgcaa 720 

aaccttgcag aacagagaac acgagaccaa tggtgtggca aagagtgatc agaaacaaga 780 

acagctgtta ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca 840 

caaaataata gagttgctga atgtcactga acttacccag aatgccctga ttaatgatga 900 

actagtggag tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg 960 

cttggatcag ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca 1020 

gcagcttaaa aagttggagg aattggaaca gaaatacacc tacgaacatg accctatcac 1080 

aaaaaacaaa caagtgttat gggaccgcac cttcagtctt ttccagcagc tcattcagag 1140 

ctcgtttgtg gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt 1200 

gaagacaggg gtccagttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa 1260 

ttataatttg aaagtcaaag tcttatttga taaagatgtg aatgagagaa atacagtaaa 1320 

aggatttagg aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc 13 80 

caccaatggc agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaaaaatgc 1440. 

tggcaccaga acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt 1500 

tgaaacccaa ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt 1560 

tgtggtgatc tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa 1620 

catgctggtg gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg 1680 

ggctcagctt tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa 1740 

tgtggaccag ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg 1800 

tctcattccg tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg 1860 

gctttggatt gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga 1920 

tgggtgcatc atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca 1980 

gccggggacc ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac 2040 

atgggtggag cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac 2100 

gaagaaagaa ctttctgctg ttactttccc tgacatcatt cgcaattaca aagtcatggc 2160 

tgctgagaat attcctgaga atcccctgaa gtatctgtat ccaaatattg acaaagacca 2220 

tgcctttgga aagtattact ccaggccaaa ggaagcacca gagccaatgg aacttgatgg 2280 



ccctaaagga 
tagacttcag 
tcggatagtg 
ttttcatctt 
tccttcacat 
gttgatagca 
gcatcttact 
aacatccaga 
ttaagcaaca 
gataaatcag 
aattcttaca 
gaaatagttc 
catggatctg 
gacaaagaca 
aatgacacta 
atatccaaag 
ctcccaaggg 
gtagataaac 
attagggtgg 
ctgacaactt 
tcctatgtaa 
gcgaatggtt 
ttcgtgaagt 
ttaagagatg 
cttgtttttc 
ggtggagata 
attaaatata 
tctcagtttt 
ctaaaaaaca 



actggatata 
accacagaca 
ggctctgtag 
ctctggcgac 
cctgtgtttc 
agtgaatttt 
gaaggtaaaa 
tacacccaaa 
tctagcaaat 
tggttattta 
tgttttcttt 
aaagccaagt 
gtagggggaa 
aattctgttt 
gctaatatca 
ctgaatacat 
agttaggcta 
tcagaaattt 
tatttagtct 
gaataataca 
ctgcattgag 
ccattctctc 
atactgtatt 
ggtttgacaa 
cactactgct 
aagatttctt 
atatcgacac 
tatatagatg 
aagaagacaa 



tcaagactga 
acctgctccc 
aattcgacag 
agttttcctt 
tagggaaatg 
tctctaactc 
ttgaaaggca 
gtatcaggac 
gttatgcata 
gggaactgct 
gctttaagtg 
ttatatacaa 
aatgtgtatt 
cttgagaaga 
atagaaggat 
tctgctttca 
ttcacaacca 
aattcatgtt 
attagccaca 
ccagagataa 
aactgcatat 
tcctgtactt 
tttacctttt 
ggttcttccc 
accacaacta 
gagttttgtt 
agtgctttcc 
gcgagaacct 
cattaaaaac 



gttgatttct 
catgtctcct 
tatgatgaac 
ctcatctgtg 
aaagaaaggc 
agaaacatca 
ttctctgaag 
gagaatgagg 
aagtcagtgc 
tgacgtagga 
taactggcag 
ttatatcagt 
ttattacatc 
gaatattagc 
gtacatttcc 
tcttggtcac 
ctcattcaaa 
tcttaaatgg 
aaattgggaa 
tatgagaatc 
gtttcgctga 
tttccagaca 
tccttcctta 
ttttacatac 
tattatcatg 
ttaaaattaa 
gtggcactgc 
aagtttcagt 
aatattgttt 



gtgtctgaag 
gaggagtttg 
acagtataga 
attccctcct 
cagcaaattc 
gttactctga 
agtgggtttc 
gtcctttggg 
ccaactgtta 
acggtaaatt 
ttttccattg 
cctctttcaa 
tttcacattg 
tttactgttt 
aaattcacaa 
atacaattat 
agttgaaatt 
gctactttgt 
aggagtagaa 
agatcatttc 
tatatgtgtt 
cttttttgag 
tcactgacac 
tgctgtctat 
caaatgctgt: 
agctaaagta 
atacaatctg 
tgattttaca 
eta 



ttcacccttc 
acgaggtgtc 
gcatgaattt 
gctactctgt 
gctgcaacct 
agggcatcat 
acaagtgaaa 
aaaggagaag 
taggttgttg 
tctgtgggag 
gtttacctgt 
aggtagecat 
gctatttaaa 
gttatggctt 
gttgtgtttg 
ttttacagtt 
aaccatagat 
cctttttgtt 
aaagcagtaa 
aaaactcatt 
tttcacattt 
tggatgatgt 
aaaaagtaga 
gtggctgtat 
attcttcttt 
tctgtattgc 
aggcctcctc 
attgaaatga 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4003 



Homo sapiens mRNA; cDNA DKF2p564K2478 (from clone DKFZp564K2478) ; complete 

/translation= n MSKAFGLliRQICQSIIiAESSQSPADLEEKKEEDSNMKREQPRERP 

RAWDYPHGLVGLHNIGQTCCIiNSLIQVFVMNVDFTRI 

LliLEKMQDSRQKAWPLEIAYCIiQKCNVPLFVQHDAAQLYIiKIiWNLI 

LQALYTIRVKDSLICVDCAMESSI^SSMLm 

S S KS KCFCENCGKKTRGKQVIjKIjTHLPQTLT IHLMRFS IRNS QTRKI CHS L YFPQSLDF 
SQILPMKRESCDAEEQSGGQYEIjFAVIAHVGMADSGHYCVYI 
LVSWEDIQCTYGNPNYHWQETAYLLVYMKMEC " 

Sequence 1874 BP; 481 A; 436 C; 489 G; 468 T; 0 other; 
agtcccgacg tggaactcag cagcggaggc tggacgcttg catggcgctt gagagattcc 
atcgtgcctg gctcacataa gcgcttcctg gaagtgaagt cgtgctgtcc tgaacgcggg 
ccaggcagct gcggcctggg ggttttggag tgatcacgaa tgagcaaggc gtttgggctc 
ctgaggcaaa tctgtcagtc catcctggct gagtcctcgc agtccccggc agatcttgaa 
gaaaagaagg aagaagacag caacatgaag agagagcagc ccagagagcg tcccagggcc 
tgggactacc ctcatggcct ggttggttta cacaacattg gacagacctg ctgccttaac 
tccttgattc aggtgttcgt aatgaatgtg gacttcacca ggatattgaa gaggatcacg 
gtgcccaggg gagctgacga gcagaggaga agcgtccctt tccagatgct tctgctgctg 
gagaagatgc aggacagccg gcagaaagca gtgcggcccc tggagctggc ctactgcctg 
cagaagtgca acgtgccctt gtttgtccaa catgatgctg cccaactgta cctcaaactc 
tggaacctga ttaaggacca gatcactgat gtgcacttgg tggagagact gcaggccctg 
tatacgatcc gggtgaagga ctccttgatt tgcgttgact gtgccatgga gagtagcaga 
aacagcagca tgctcaccct cccactttct ctttttgatg tggactcaaa gcccctgaag 
acactggagg acgccctgca ctgcttcttc cagcccaggg agttatcaag caaaagcaag 
tgcttctgtg agaactgtgg gaagaagacc cgtgggaaac aggtcttgaa gctgacccat 
ttgccccaga ccctgacaat ccacctcatg cgattctcca tcaggaattc acagacgaga 
aagatctgcc actccctgta cttcccccag agcttggatt tcagccagat ccttccaatg 
aagcgagagt cttgtgatgc tgaggagcag tctggagggc agtatgagct ttttgctgtg 
attgcgcacg tgggaatggc agactccggt cattactgtg tctacatccg gaatgctgtg 
gatggaaaat ggttctgctt caatgactcc aatatttgct tggtgtcctg ggaagacatc 
cagtgtacct acggaaatcc taactaccac tggcaggaaa ctgcatatct tctggtttac 
atgaagatgg agtgctaatg gaaatgccca aaaccttcag agattgacac gctgtcattt 
tccatttccg ttcctggatc tacggagtct tctaagagat tttgcaatga ggagaagcat 
tgttttcaaa ctatataact gagccttatt tataattagg gatattatca aaatatgtaa 
ccatgaggcc cctcaggtcc tgatcagtca gaatggatgc tttcaccagc agacccggcc 
atgtggctgc tcggtcctgg gtgctcgctg ctgtgcaaga cattagccct ttagttatga 
gcctgtggga acttcagggg ttcccagtgg ggagagcagt ggcagtggga ggcatctggg 
ggccaaaggt cagtggcagg gggtatttca gtattataca actgctgtga ccagacttgt 
atactggctg aatatcagtg ctgtttgtaa tttttcactt tgagaaccaa cattaattcc 
atatgaatca agtgttttgt aactgctatt catttattca gcaaatattt attgatcatc 
tcttctccat aagatagtgt gataaacaca gtcatgaata aagttatttt ccacaaaaaa 
aaaaaaaaaa aaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1874 



Homo sapiens cDNA FLJ20073 fis, clone COL02320. 

{^f nSlati ° n= " MGTYS TILIKTEVIECGNYCGVRI IHSLIAEFSLEELKKSYHLNK 

NEAVEAVLLES IHRFNPNAFICQALARHFYIKKKDFGNALNWAKQAKI IEPDNS YISDT 
LGQVYKSKIRWWIEENGGNGNISVDDLIAIjIjDLAEHASSAFKESQQQSEDREYEVKERL 
^^^ ra ^^ GYQGEI ^ GLYTIQILQLIPFFDNKNE LSKRyMVNFVSGSSDIPG 
DPI^KI^KNYIPYLTKLKFSLKKSFDPFDEYFVLLKPP^IKQNEEAKTRRKVAGY 
FKKYVDIFCLLEESQNNTGLGSKFSEPLQVERCPJ^VALKADKFSGLLEYLIKSQEDA 
^™^ CIWE ^^ LEQC: ^Q S ^^FIIANIILSCIQPTSRLVKPVEKLKDQLRE 
^QPIGLTYQFSEPYFIASLLFWPENQQLDQHSEQMKEYAQALKNSFKGQYKHMHRTKQ 
^fll LGKGKR1 ^^ G ^DQCPKKTPDINSLWQSGDVWKEEKVQELLLRLQGRAE 
NNCLYIEYGINEKITIPITPAFLGQLRSGRSIEKVSFYLGFPIGGPLAYDIEIV" 



Sequence 3401 BP; 1260 A; 588 C; 619 G; 934 T; 0 other; 
aaaatttgaa gacaagatgg gcacctactc tacaattctg ataaaaacag aggtcatcaa 
SSoaao r Ct9t ? 9 ^ 9 taC * Catcat tcactctttg attgcagag? ScactgS 
a 9aa ^f aa ^ aaaa3 ^ tatc acctgaataa aagtcaaatt atgttggata tgctaacfja 
9aa "! 9ttC ttc 9atactg gtatgggaaa aagtaaattt ttgcaagata tgcacacact 
tat a a« a o a a9acacc 9 c 9 atgaacatga aggtgaaaca ggaaattggt tttccccatt 
tatSt 9 ^ " aCataaa 9 atgaaggaaa tgaagcagtt gaagctgtat tgcttgaaag 
tatccatcgg ttcaacccaa atgcattcat ttgccaagcg ttggcaagac atttctacat 
t a ^ aa9aa9 TFVF 0 * at 9 ctctaaa ctgggcaaaa caagcaalaa tcatagaacc 
tgacaattct tatatctcag atacactggg tcaagtctac aaaagtaaaa taagatggtg 
gatagaggaa aacggaggaa acgggaacat ttcagttgat gatctaattg ctctttSgga 
tttagcagaa catgcctcaa gtgcattcaa agaatctcaa cagcaaagtg aagatagaga 
ESS? aa93aaa 9 at tgtatccgaa gtcaaaaagg cggtatgatJ ctLcaatlt 
SS 99 " 3 ! caa 99 a gaga tagaagttgg gctttacaca atccaaattc tccagctcat 
aaaSS^J 9ataataaaa atgagctatc taaaagatat atggtcaatt ttgtatcagg 
! a9 ! a9 ! 9at at ^ cca 9999 atccaaacaa tgaatataaa ttagccctca aaaactatat 
tccttattta actaaattga aattttcttt gaaaaagtcc tttgattttt ttgatgaata 
ctttgtcctg ctaaaaccca ggaacaatat taagcaaaat gaagaggcca aaactcggag 
aaaggtggct ggatatttta agaaatatgt agatatattt tgtltlltag aagaatcaca 
aaaoaacaca ggtcttggat caaagttcag tgagccactt caagtagagl gafgcaggag 
tcaaaaaoat 9C "* aaaa9 <=agacaagtt ttctgggctc ttggaaLfc ctafcaaaa? 
tcaagaggat gctataagca ctatgaaatg tatagtgaac gaatatactt ttctcttaga 
^ aa ^ 9CaC ^ 9tcaaaatcc agtcaaaaga aaagctaaat ttcatcttgg ccaacattat 
tctctcctgt atccaaccta cctccagatt agtaaagcca gttgaaaaac taaaagatca 
S^ 9a9aa ftcttgcaac caataggact gacttatcag ttttcagaac cgtattttct 
IZlltlZ t ttattctggc cagaaaatca acaactagat caacat?ctg alcaaatgaa 
S!£ 9 c t agcactaa a aaattcttt caaggggcaa tataaacata tgcatcgtac 
aaagcaacca attgcatatt tctttcttgg aaaaggtaaa agactggaaa gacttgttca 
taaaal aaaa '"W* 9ctttaagaa gacaccagat a?taa? 9 C ct: JgtggSgaJ 
tggagatgtg tggaaggagg aaaaagtcca agaacttttg cttcgtttac aaggtcgagc 
tcccacS^ J? tttatata tagaatatgg aatcaatgal aaaa?cacaa tacStcac 
HaJlttl "aggtcaac ttagaagtgg cagaagcata gagaaggtgt ctttttacct 
999a "! CCC atfc 99 a 9gcc cacttgctta tgacattgaa attgtttaag agcctgatat 
tcttcctcca agaatttgat ctcagtaccc atttaatttt tttggactca agatctatgc 
tttaaaccgg caaggttata gatacagcct ctagctcttc agatctgtac atgcagtatt 
tlT.Tt ° ttaaacat 9t tatgagttct acaaggacaa tagtgaaaaa ggLggagtg 
agatatatga aaagtagcaa atatgttcct tggtttggtt aacatcattg atgacaaaa? 
aa ! aa ! 9a !° tat 9 act 9g a gtcaggagaa gttagtgtaa taagctggct acacagaacc 
ccactactta ccaggcatgg attgaagaag attgtctact caaatggcat ttagacatta 
gaatgtctgg gaaaatattt ctcaaagaca gcaaaaacct ctcaaactga ggagcaacat 
ttattcttac taagcagatc atcaatgtat catgtgcttg gcactcaagg atcttccaaa 
acagaggacc aaccagtctt ctgaaggtca tgcccacaga agtcatcgga ccttaccaaa 
gtaggttgga gaattagatt gccttttcat gcagtgagat tcagttaagc aaaaatgaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



2580 
2640 
2700 
2760 
2820 
2880 
2940 



tttgtctcta tagctaatta gcttatcaac tcccctccaa acaaacaatt aaaaaaaaaa 
catacagaca ctcaaattcc acaagctaat gaacaaaagg gactcttgtg agaagactaa 
tgagtccctc atccagaaga tgccaatgta ctggcagatt aacatacaac ctatgttttg 
aacaaaaaca accagcgata cgtaatcaaa atgtaatttt cccctaataa aattatggat 
atgggcagtc atcaatggct gccaaaacca ttaagtggaa agctgattaa aaaacaaaaa 
tttctaatgg atttatcaaa ctgtcccaaa tcctgataaa tattaacatc acagaggaag 
accagacatt atgggcctgg aagtactata ggagtgcaca catcacccgt gacatggtct 
tgccaaataa ttaaacctga atttgatcag gtctctggat cttatttgca attcaaaaga 3000 
aattttaaaa aaatcctact aacaccacca caaatatgca atcagcaata tccagaaagg 3060 
ggaaattcac aggacaaaaa cctggttttc ttttttggtt tcttcaacca aaaaagaaag 3120 
aaattgcaaa ggaccaaaaa aatgttgggg aatctataca ttataaggga cttaacaact 3180 
aaagggcaac atatagactt tagatcctaa tttgagcaaa atctaaaatc aattattagg 3240 
caatcagaaa aatttgaaca cagactagat atttgaggat attaaggtac tatattattg 3300 
aagattccat ggttatgttt tttaaagagt tcatgccttt tagagataca tactaaagta 
tttgtaaata aatgacatga tctagaaaaa aaaaaaaaaa a 



3360 
3401 



E Homo sapiens cDNA FLJ10 913 fis, clone OVARC1000209 , weakly similar to Oryza 
E sativa submergence induced protein 2 A mRNA. 

T / trans 1 at ion= n MVLAWYl^DAPGDPRQPHRPDPGRPVGLEQIjRI^GVIjyWKIjDADK 

T YENDPEIjEKIRRERNYSWIJOIITICKDKLPNYEEK^ 

T FDVRDKEDQWIRIFMEKGDIWTLPAGIYH^ 

T HFEARGQYVKFLAQTA" 

X 

Q Sequence 1628 BP; 440 A; 349 C; 389 G; 450 T; 0 other; 

gagcgcggcc cctgggttcg aacacggcac ccgcactgcg cgtcatggtg ctggcctggt 60 

atatggacga cgccccgggc gacccgcggc aaccccaccg ccccgacccc ggccgcccag 120 

tgggcctgga gcagctgcgg cggctcgggg tgctctactg gaagctggat gctgacaaat 180 

atgagaatga tccagaatta gaaaagatcc gaagagagag gaactactcc tggatggaca 240 

tcataaccat atgcaaagat aaactaccaa attatgaaga aaagattaag atgttctacg 3 00 

aggagcattt gcacttggac gatgagatcc gctacatcct ggatggcagt gggtacttcg 360 

acgtgaggga caaggaggac cagtggatcc ggatcttcat ggagaaggga gacatggtga 420 

cgctccccgc ggggatctat caccgcttca cggtggacga gaagaactac acgaaggcca 4 80 

tgcggctgtt tgtgggagaa ccggtgtgga cagcgtacaa ccggcccgct gaccattttg 540 

aagcccgcgg gcagtacgtg aaatttctgg cacagaccgc ctagcagtgc tgcctgggaa 600 

ctaacacgtg cctcgtaaag gtccccaatg taatgactga gcagaaaatc aatcactttc 660 

tctttgcttt tagaggatag ccttgaggct agattatctt tcctttgtaa gattatttga 720 

tcagaatatt ttgtaatgaa aggatctaga aagcaacttg gaagtgtaaa gagtcacctt 780 

cattttctgt aactcaatca agactggtgg gtccatggcc ctgtgttagt tcatgcattc 840 

agttgagtcc caaatgaaag tttcatctcc cgaaatgcag ttccttagat gcccatctgg 900 

acgtgatgcc gcgcctgccg tgtaagaagg tgcaatccta gataacacag ctagccagat 960 

agaagacact tttttctcca aaatgatgcc ttggggtggg gagtggtagt gggaagagct 1020 

cccaccctaa ggggcacaca ctgagttgct tatgccactt ccttgttcaa aataaagtaa 1080 

ctgccttaat cttatactca tggcttggag ttaccttata ttcaggtata tgtgatattt 1140 

tgcctggttt gttaaaattg ccccatttag attccttcta taattgttct tatagataag 1200 

taatttatat atgagctgtg ttagtatttt tttcagtgtg agatctctgg attctttcac 1260 

aataaagctg ttgaatttta acaggagtat tagtacataa attttctact caacaattcc 1320 

gagataggat tatgcctagt ttgtcatatc acagaaaaac tccaagttaa cttcatgttt 1380 

tggaagggca ggtcgttttt aaagtatttc tttttttaac tggatgaaaa atcttcatgt 1440 

taggattaat tttcttaatc acctccacac tgtacagagg aaactcaagc cttaaatgtt 1500 

taagtaaact ctgtctcagt tttaggatta aaatacccac cggtggtgtg atgatgccat 1560 

ataccgcagg gcttgcttct gtcaagtgtg actctatctc agtaattaaa ataagtgctg 1620 

atctactg 162a 



Homo sapiens cDNA: FLJ22242 fis, clone HRC02528. 

/trans 1 at ion= " MALGLCRCFHPRHSMAAFGLFPALPS ALNSHPACTCLLDPSTWRP 
AHVSGPAIASSPQILSVFSLGFPGFATNGSCVSRYKPDIIFPPGLPPPDLPSSVSIFCLQ 
LLCSHGHCCITESGPIiSFSNWPPSIiVPHFI.KSPVHCHQIKLSPARSPI.SEKPPI.TWKH 
HCLAHILTYPPSRLDPHTSFQPPLPLHSLLSPPPPHPLVSPPL" 

Sequence 1300 BP; 268 A; 413 C; 227 G; 392 T; 0 other; 
aacttttaaa aactctcatt ggagtaagtc ttttcaagat gatcctccac aatggaggca 
gcgttcctac ttgtcatcac acagctgaag acattgtttc ttaggtgtga aatcggggac 
aaaggacaaa cagagacaca cggcattgtt catgggaggc atcgtcaccc tcctgggtgt 
tctgtgggaa tttcctgtgt gaggaaaacg tggccacagg gttgtgctgt acccaccctt 
ccccggcgag atggccctcg gcctgtgccg ctgcttccac cctcgccact ccatggcagc 
ttttggtctg tttccggctc tgccctctgc cctgaactct catccggctt gtacctgcct 
gctggacccc tccacctgga ggccagccca tgtctcaggc ccagccctag cctcttctcc 
?caaattcta agtgttttct ctttaggttt ccctggcttt gtgaatggat catgtgtctc 
taggtataaa cctgacatca tctttccacc cggcttacct ccaccagatc tccccagttc 
tgtctccatc ttctgcctgc agctgctctg ttctcatggt cactgctgca tcactgagtc 
tggacccttg ttatcatttt caaactggcc tccttccctc gttccccact tcttaaagtc 
acctgtccat tgccaccaga ttaagctttc tccagccaga tcacctctct ctgagaaacc 
tccattgaca tggaaacacc attgtctggc acacatactc acatacccac cttcccgtct 
tgatccccac acatctttcc agcctcccct cccactccac tccctgctct ctcctccacc 
tccccatcct cttgtctccc ctcccctctg aatccagccc agcggggctt ctcctgcctc 
catcacatca cagaagtacc tcctgcttct ggttttaatt agagccttcc ccgattacat 
tttcctctga attttttcct atctacattt gatctgtcat gtttaaaccc cctacttcta 
agggaacttc tctaatctct tatcctcatc cccaaatagt gttttcttcc tctgggttct 
tataatgttg gtatcaatct cacagcattt agtgcttcct gcctggtgtg acagttacct 
gtgtgcatgt gcaatttcta atttcccacg ctagactgtg agcttcctaa ggcaagaatc 
atgcctcgtt ggtttctgta ttcctcatgg tgccaaacac agtgccttct acattgcagg 
cgctgaataa acatttttaa agcaaaaaaa aaaaaaaaaa 
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3E ta77f02.x2 NCI_CGAP_HSC2 Homo sapiens cDNA clone IMAGE : 2050107 3> similar 
3E to gb:L19779 HI STONE H2A. 1 (HUMAN) ; , mENA sequence. 

tatacggctg cgagaagacg acagaagggg cacctgtgaa ctcaaaaggc tcttttcaga SO 
gccacccacg ttttcaaata aaagagttgt taatgctggc cactcccaaa aaaaaaaaaa 120 
aaaaaaaaaa agtcgtatcg a 141 



H. sapiens centromere autoantigen C (CENPC) mRNA, complete cds. 

/trans 1 at ion» n MAASGLDHLKNGYRRRFCRPSRARD INTEQGQNVIjEIIjQDCFEEK 

SIjANDFSTNSTKSVPNSTRKIKDTCIQSPSKECQKSHPKSVPVSSKKKEASLQFVVEPS 

EATNRSVQAHEVHQKIIiATDVSSKNTPDSKKI S SRNINDHHSEADEEFYLS VGSPSVLL 

DAKTSVS QNVI PS S AKKRETYTFENS VNMLP S S TEVS VKTKKRLOTDDKVMLKKIE IDN 

KVSDEEDKTSEGQERKPSGSSQNRIRDSEYEIQRQAKKSFSTLFLETVKRKSESSPIVR 

HAATAPPHSCPPDDTKLIEDEFIIDESDQSFASRSWITIPRKAGSLKQRTISPAESTAL 

FQGRKSREKHHNIIjPKTIjANDKHSHKPHPVETSQPSDKTVLDTSYAIjID 

YEMYSKNAEKPSRSKRTIKQKQRRKFMAKPAEEQLDVGQSKDENIHTSHITO 

DRNMEEHEEMGND CVS KKQMPP VGS KKS S TRKDKE E S KKKRF S S E S KNKL VPEEVTS TV 

TKSRRISRRPSDWWV\nCSEESPVYSNSSVRlffil»PMHHNSSRKSTKKTNQSSKNIRKKT 

PIjKRQKTATKGNQRVQKFLNAEGSGGIVGHDEISRCSLSEPLESDEADIjAKKKNIjDCSR 

STRSSKNEDNIMTAQNVPIiKPQTSGYTCNIPTESNLDSGEHKTSVI^ 

MSGKNDVDDEEVHGSSDDSKQSKVIPKl^IHHKLVLPSNTPNVRRT 

ERIDYQGRPSGGFVISG^SPDTISSKRKAIO^ 

NIjGI PLGDPIiQPTRVKDPETRE I ILMDLVRPQDTYQFFVKHGELKVYKTLDTPFF S TGK 
L ILGPQEEKGKQHVGQD ILVFYVNFGDLLCTLHETPYIIjS TGDS FYVPS GNYYNI KNIiR 
NEES VLIiFTQ I KR " 

Sequence 3132 BP; 1164 A; 542 C; 630 G; 796 T; 0 other; 
cggatcgcag ctctcgcggc agtcgcctga gacttaaggt tattgcttgg ccgcggcctg 
gtattccggc gattcgtttc ttgctcggct tcctggagct gtggtccgtg tgggcttcca 
cctcagacag ttgcgctggc tcagcggggc cggaacatgg ctgcgtccgg tctggatcat 
ctcaaaaatg gctacagaag aagattttgt cgaccttcca gggcacgtga cattaacaca 
gagcaaggcc agaatgttct ggaaatctta caagactgtt ttgaagaaaa aagtcttgcc 
aatgatttta gtacaaattc tacaaaatca gtgcctaatt caacacgcaa aataaaagac 
acttgtattc agtcaccaag caaagagtgc cagaaatcac atccaaagtc agttccagtt 
tcttcaaaga agaaagaagc ctctctacag tttgttgtag aaccaagtga agccacaaac 
agatcagttc aggcccatga agttcatcag aaaattctgg caactgatgt tagttccaaa 
aatacacctg actcgaaaaa aatatcaagt agaaacataa atgatcatca cagtgaagct 
gatgaagaat tttacttatc cgttggctca ccttctgttc ttttggatgc aaaaacatct 
gtatcacaaa atgttattcc atctagtgcc aaaaagagag agacttacac ttttgaaaat 
tcagtaaata tgctgccttc aagtacagag gtttcagtta aaaccaaaaa aaggttaaac 
tttgatgata aagttatgtt aaagaaaata gaaatagata ataaagtatc agatgaagag 
gataaaacat cggaaggaca agaaagaaaa ccatcaggat catctcagaa tagaatacga 
gattcagaat atgaaattca acgacaagct aaaaaaagtt tttcaacatt gtttttagaa 
acagtaaaac gaaaaagtga atccagtccc attgttaggc atgcggcaac tgctccacct 
cattcgtgtc ctcccgatga tacgaagttg atagaggatg aatttataat tgatgagtcg 
gatcaaagtt ttgccagtag atcttggatt acaataccaa gaaaggcagg gtctctgaaa 
caacgcacaa tatccccggc tgagagcact gcactctttc aaggtagaaa gtcaagagaa 
aagcatcata atatattacc taagactttg gcaaatgaca aacattccca taaacctcac 
ccagtagaga catctcagcc ctctgataaa acagtactgg atacaagtta tgctttgata 
gatgaaacag taaataatta tagatctaca aaatatgaaa tgtattccaa gaatgcagaa 
aaaccatcta gaagcaaaag gactataaaa caaaaacaga gaagaaaatt catggctaaa 
ccagctgaag aacagcttga tgtgggacag tctaaagatg aaaacataca tacatcacat 
attacccaag acgaatttca aagaaattca gacagaaata tggaagagca tgaagagatg 
ggaaatgatt gtgtttccaa aaaacagatg ccacctgtgg gaagcaagaa aagtagcact 
agaaaagata aggaagaatc taaaaagaag cgcttttcca gtgagtccaa gaacaaactt 
gtacctgaag aagtgacttc aactgtcacg aaaagtcgaa gaatttccag gcgtccatct 
gattqgtggg tggtaaaatc agaggagagt cctgtttata gcaattcttc agtaagaaat 
gaattaccaa tgcatcacaa tagtagccga aaatctacta agaaaacaaa tcagtcatct 
aagaatatta ggaaaaaaac tattccactt aaaaggcaga agacagcaac taaaggcaac 
caaagagtac agaagttttt aaatgctgaa ggttctggag gtatcgttgg tcatgatgaa 
atttccagat gttcactgag tgagccattg gaaagtgatg aggcagactt ggctaagaag 
aaaaatcttg attgttctag atctacaaga agctcaaaga atgaagataa cattatgact 
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gcacagaatg 
tcaaacttgg 
ctcaataata 
agttcagatg 
gtattgccct 
gagtactggc 
agtggagtac 
aaagtcaaca 
aacttaatgg 
gacccagaaa 
ttttttgtta 
actgggaaat 
atattggttt 
atattaagta 
ctccggaatg 
ccttaaatat 
gtctttgtaa 
aaaaaaaaaa 



ttcccctaaa 
attctggaga 
attatttaat 
actcaaaaca 
ccaacacacc 
gaggagagcg 
tatctccaga 
aaaaatctaa 
taaatctagg 
caagagagat 
agcatggtga 
tgatattagg 
tttatgttaa 
ctggggattc 
aggaaagtgt 
atgtatgtat 
ttacttgtga 
aa 



gcctcagacc 
gcataagact 
gtctggaaag 
atctaaagtg 
aaatgttcgc 
aatagattat 
cacaatatcg 
taagaaaagg 
tatacctctt 
tattctcatg 
gttgaaggta 
accacaagaa 
ctttggtgac 
gttctatgtt 
tcttcttttt 
atatgtatat 
tgttttaaaa 



agtggatata 
tcagttttag 
aatgatgtgg 
ataccaaaga 
aggaccaaga 
caaggaaggc 
tctaaaagga 
atctgtcttg 
ggagatcctt 
gatcttgtaa 
tacaagacat 
gaaaagggaa 
cttttgtgta 
ccttcaggta 
actcagataa 
gtaaaaacag 
taaaaatttt 



catgtaatat 
aggaaagtgg 
atgatgagga 
acagaatcca 
gaacacgttt 
catcaggagg 
aggcaaaaga 
ataacgatga 
tgcagccaac 
ggccacaaga 
tggatacacc 
agcagcatgt 
ctttacatga 
actattataa 
aaagatgaaa 
tttgtatagt 
attcagtttt 



accaacagag 
accttccagg 
agttcatgga 
tcacaaacta 
gaaacctttg 
attcgtgatt 
aaatattgga 
aagaaagact 
gagggtaaag 
tacatatcaa 
ctttttttct 
tggccaggat 
aacaccttat 
catcaaaaat 
gatcaaccaa 
tggaatattt 
gtgtaaaaaa 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
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2820 
2880 
2940 
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3060 
3120 
3132 



Homo sapiens transcription factor ISGF-3 mRNA, complete cds . 

transcription factor. 

/ trans 1 a t i on= " MSQWYEIiQQLDSKFIiEQVHQLYDDSFPMEIRQYIiAQWIiEKQDWEH 
AANDVS FATIRFHDLLSQI1DDQYSRFSLENNFLI1QHNIRKS KRNIjQDNFQEDP I QMSMI 
IYSCLKEERKILENAQRFNQAQSGNIQSTVMLDKQKEIjDSKVRNVKD 
EDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQ 

VTELTQNALINDELVEWKRRQQSACIGQPPNACIiDQLQNWFTrVAESIiQQWQQLKKLE 
ELEQKYTYEHDP ITKNKQVTjWDRTFSLFQQIjI QS SFWERQPCMPTHPQRPLVLKTGVQ 
FTVKLRLLVKLQELNYNLK^TKVL 

IiAAEFRHLQLKEQKNAGTRTNEGPIiIVTEEI^SLSFETQIiCQPGLVIDLETTSLPVVVI 

SNVSQLPSGWASILWYNI^VAEPRNLSFFLTPPCARWAQLSEVLSWQFSSW 

QLNTOiGEKIiliGPNASPDGLIPWTRFCKENIlSroKN^ 

CIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT 
KKELSAVTFPD I IRIT^KVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD 
GPKGTGYIKTEIiISVSEVHPSRIjQTTDl^LPMSPEEFDEVSRIVGSVEFDSMMNTV n 



Sequence 4003 BP; 1173 A; 812 C; 883 G; 1135 T; 0 other; 
attaaacctc tcgccgagcc cctccgcaga ctctgcgccg gaaagtttca tttgctgtat 
gccatcctcg agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc 
ttggcaccta acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg 
ggcacaaggt ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct 
ggagcaggtt caccagcttt atgatgacag ttttcccatg gaaatcagac agtacctggc 
acagtggtta gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat 
ccgttttcat gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa 
taacttcttg ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca 
ggaagaccca atccagatgt ctatgatcat ttacagctgt ctgaaggaag aaaggaaaat 
tctggaaaac gcccagagat ttaatcaggc tcagtcgggg aatattcaga gcacagtgat 600 
gttagacaaa cagaaagagc ttgacagtaa agtcagaaat gtgaaggaca aggttatgtg 660 
tatagagcat gaaatcaaga gcctggaaga tttacaagat gaatatgact tcaaatgcaa 
aaccttgcag aacagagaac acgagaccaa tggtgtggca aagagtgatc agaaacaaga 
acagctgtta ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca 
caaaataata gagttgctga atgtcactga acttacccag aatgccctga ttaatgatga 
actagtggag tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg 
cttggatcag ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca 
gcagcttaaa aagttggagg aattggaaca gaaatacacc tacgaacatg accctatcac 
aaaaaacaaa caagtgttat gggaccgcac cttcagtctt ttccagcagc tcattcagag 
ctcgtttgtg gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt 
gaagacaggg gtccagttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa 
ttataatttg aaagtcaaag tcttatttga taaagatgtg aatgagagaa atacagtaaa 
aggatttagg aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc 
caccaatggc agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaaaaatgc 
tggcaccaga acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt 
tgaaacccaa ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt 
tgtggtgatc tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa 
catgctggtg gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg 
ggctcagctt tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa 
tgtggaccag ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg 1800 
tctcattccg tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg I860 
gctttggatt gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga 192 0 

tgggtgcatc atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca 1980 
gccggggacc ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac 2 04 0 

atgggtggag cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac 2100 
gaagaaagaa ctttctgctg ttactttccc tgacatcatt cgcaattaca aagtcatggc 2160 
tgctgagaat attcctgaga atcccctgaa gtatctgtat ccaaatattg acaaagacca 2220 
tgcctttgga aagtattact ccaggccaaa ggaagcacca gagccaatgg aacttgatgg 2280 
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ccctaaagga actggatata tcaagactga gttgatttct gtgtctgaag ttcacccttc 2340 

tagacttcag accacagaca acctgctccc catgtctcct gaggagtttg acgaggtgtc 2400 

tcggatagtg ggctctgtag aattcgacag tatgatgaac acagtataga gcatgaattt 2460 

ttttcatctt ctctggcgac agttttcctt ctcatctgtg attccctcct gctactctgt 2520 

tccttcacat cctgtgtttc tagggaaatg aaagaaaggc cagcaaattc gctgcaacct 2580 

gttgatagca agtgaatttt tctctaactc agaaacatca gttactctga agggcatcat 2640 

gcatcttact gaaggtaaaa ttgaaaggca ttctctgaag agtgggtttc acaagtgaaa 2700 

aacatccaga tacacccaaa gtatcaggac gagaatgagg gtcctttggg aaaggagaag 2760 

ttaagcaaca tctagcaaat gttatgcata aagtcagtgc ccaactgtta taggttgttg 2 820 

gataaatcag tggttattta gggaactgct tgacgtagga acggtaaatt tctgtgggag 2 880 

aattcttaca tgttttcttt gctttaagtg taactggcag ttttccattg gtttacctgt 2940 

gaaatagttc aaagccaagt ttatatacaa ttatatcagt cctctttcaa aggtagccat 3 000 

catggatctg gtagggggaa aatgtgtatt ttattacatc tttcacattg gctatttaaa 3 060 

gacaaagaca aattctgttt cttgagaaga gaatattagc tttactgttt gttatggctt 3120 

aatgacacta gctaatatca atagaaggat gtacatttcc aaattcacaa gttgtgtttg 3180 

atatccaaag ctgaatacat tctgctttca tcttggtcac atacaattat ttttacagtt 3240 

ctcccaaggg agttaggcta ttcacaacca ctcattcaaa agttgaaatt aaccatagat 3300 

gtagataaac tcagaaattt aattcatgtt tcttaaatgg gctactttgt cctttttgtt 3360 

attagggtgg tatttagtct attagccaca aaattgggaa aggagtagaa aaagcagtaa 3420 

ctgacaactt gaataataca ccagagataa tatgagaatc agatcatttc aaaactcatt 3480 

tcctatgtaa ctgcattgag aactgcatat gtttcgctga tatatgtgtt tttcacattt 3540 

gcgaatggtt ccattctctc tcctgtactt tttccagaca cttttttgag tggatgatgt 3 600 

ttcgtgaagt atactgtatt tttacctttt tccttcctta tcactgacac aaaaagtaga 3 660 

ttaagagatg ggtttgacaa ggttcttccc ttttacatac tgctgtctat gtggctgtat 3 720 

cttgtttttc cactactgct accacaacta tattatcatg caaatgctgt attcttcttt 3780 

ggtggagata aagatttctt gagttttgtt ttaaaattaa agctaaagta tctgtattgc 3 840 

attaaatata atatcgacac agtgctttcc gtggcactgc atacaatctg aggcctcctc 3 900 

tctcagtttt tatatagatg gcgagaacct aagtttcagt tgattttaca attgaaatga 3 960 

ctaaaaaaca aagaagacaa cattaaaaac aatattgttt eta 4 003 



Homo sapiens ornithine decarboxylase (ODC1) mRNA, complete cds. 
/protein id="AAA59966 .2" 

/Lan S lation=«MNNFGNEEFDCHPI^EGFTAKDII^QKINEVSSSDDKD^YyM)L 
OTILKKHIiRWLKM.PRVTPFYAVKC^^ 

PERIIYANPCKQVSQIKYAM^GVQMMTFDSEVE1.MKVARAHPKAKLVLRIATDDS^V 

crlsvkfgatlrtsf^li,erakei^idwgvsfhvgsgctdpetfvqaisdarc™g 

AEVGFSMYLIjDIGGGFPGSEDVKIjKFEEITGVINPAIjDKyFPSDSGVRIIAEPGRYyyA 

SAFTIiAVNI IAKKIVLKEQTGSDDEDES SEQTFMYYVNDGVYGS FNCHjYDHAHVKPIjIi 
QKRPKPDEKYYSSSIWGPTCDGI^RIVERCDLPEMHVGDW^FEl^GAYTVAAASTFNG 
FQRPTIYYVMSGPAWQLMQQFQNPDFPPEVEEQDASTLPVSCAWESGMKRHRAACASAS 

INV" 



Sequence 1815 BP; 485 A? 365 C; 448 G; 517 T; 0 Other; .... 
gaattcctgg agagttgcct ttgtgagaag ctggaaatat ttctttcaat tccatctctt 
Lttttccl? aggaacatca agaaatcatg aacaactttg gtaatgaaga gtttgactgc 
cacttcctcg atgaaggttt tactgccaag gacattctgg accagaaaat taatgaagtt 
tcttcttctg atgataagga tgccttctat gtggcagacc tgggagacat tctaaagaaa 
catctgagg? gg?taaaagc tctccctcgt gtcaccccct tttatgcagt caaatgtaat 
gatagcalag ccatcgtgaa gacccttgct gctaccggga caggatttga ctgtgctagc 
Lgactgaaa tacagttggt gcagagtctg ggggtgcctc cagagaggat tatctatgca 
aa?cct?gta aacaagtatc tcaaattaag tatgctgcta ataatggagt ^^agatgatg 
acttttgata gtgaagttga gttgatgaaa gttgccagag cacatcccaa ^gcaaagttg 
qttttgcgga ttgccactga tgattccaaa gcagtctgtc gtctcagtgt gaaattcggt 
lccaclc?ca gaaccagcag gctccttttg gaacgggcga aagagctaaa tatcgatg" 
Ltggtgtca gcttccatgt aggaagcggc tgtaccgatc ctgagacctt c 9tgcaggca 
atclltgatg cccgctgtgt ttttgacatg ggggctgagg ttggtttcag catgtatctg 
c«gatattg gcggtggc?t tcctggatct gaggatgtga aacttaaatt tgaagagatc 
accggcgtaa Laacccagc gttggacaaa tactttccgt cagactctgg agtgagaatc 
ataSSagc ccggcagata ctatgttgca tcagctttca cgcttgcagt taatatcatt 
gccaaglala ttglatLaa ggaacagacg ggctctgatg acgaagatga g^cgagtgag 
cagaccttta tg?attatgt gaatgatggc gtctatggat catttaattg catactctat 
aaccacgcac atgtaaagcc ccttctgcaa aagagaccta aaccagatga gaagtattat 
?catccagca ta^ggggLc aacatgtgat ggcctcgatc ggattgttga ^gctgtgac 
ctQcctgaaa tgcatgtggg tgattggatg ctctttgaaa acatgggcgc ttacactgtt 
gcScScct c?acg?tcaa tggcttccag aggccgacga tctactatgt ^tgtcaggg 
cctgcgLgc aactcatgca gcaattccag aaccccgact tcccacccga agtagaggaa 
caglScca gcaccctgcc ^gtgtcttgt gcctgggaga gtgggatgaa *cgccacaga 
gcagcc?gtg cttcggctag tattaatgtg tagatagcac tctggtagct gttaactgca 
agt^tagct? gaatlaaggg atttgggggg accatgtaac ttaattactg ctagttttga 
aatgtcfttg Lagagtagg gtcgccatga tgcagccata tggaagacta ^tatgggt 
cacacttatc tgtgttccta tggaaactat ttgaatattt gttttatatg S^ttttatt 
cactcttcag acacgctact caagagtgcc cctcagctgc tgaacaagca tttgtagctt 
gtacaatggc agaatgggcc aaaagcttag tgttgtgacc tgtttttaaa ataaagtatc 
ttgaaataat taggc 
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DE 



Homo sapiens hephaestin (HEPH) mRNA, complete cds. 



5T 



.J ^CAQKRVAVmLFGMGNEIDWTAPFHGQMLTTRGHHTDVANIFPATFVTAEMVPWEP 

£ ^5f TT Q ^ SHPraGMQ ^ Y ™ CS ^ PWI ' LTGKVR QYFIEAHEIQ W DYGPMGS 

r T ^^^ GPTAQDPA ^ T ^ YPS ^ PI ^ TOSGLVGPL LVCRAGALGADGKQKGVDK 

S EFFLLFTVIJDENKSWYSNANQAAA^ 

Z ^T™™ LGLGTETO ^ GWFQGmQLQG ^ GA ^FPHTFVMAIMQPDNLG^ 

£ ^53wSo^^^^ EAGM ^ AI YNVS ^ CPG HQATPRQR YQAAR I YYI MAEEVEWD Y C PDRS WE 

£ f?^^f 3SyGyiFLS ^ GL ^ SR ™™™ TFRIPRP RTGPEE H SSG? 

4 L I KGEVGDI LTVVFKNNASRP YS VHAHGVLE STTVWPLAAEPGE VVTYOWNI PERS GPG 

'T ^^^ AraGSQDPGS1 ^ QDETF ^ SNI ^ raG ^AOT J RGLTMYQGERVAWY 

£ ^GQDVDMTIHFHAESFLYRNGENYRADVVDLFPGTFEVVEMVASNPGTWL^^ 

T LASVLVAIS VTLLLVVIiALGGVVWYQBRQRKIjRKNRRS ILDDS FKLLS FKQ " 



£XE£ 066 Af 1000 C; 1077 6 ' 1072 T > 0 other; 

cctgtttccc agagtaatgt gggccatgga gtcaggccac ctcctctggg ctctgctatt 
catgcagtcc ttgtggcctc aactgactga tggagccact cgagtctlc? ^ccSggfa^ 
ccgggatgtg cagtggaact atgctcccaa gggaagaaat gicatcacga Iccagcctct 
ggacagtgac atagtggctt ccagcttctt aaagtctgac Lgaaccggl Jagggggaac 
occta^t™ aCCatC ^ ata aagaatacaa ggatgactca tacacagSg aaSgccca 
gcctgcctgg ttgggcttcc tggggccagt gttgcaggct gaagtggggg atotcattct 
tattcacctg aagaattttg ccactcgtcc ctataccatc cacccKKg SScttcta 
taaSSS ^ Ct9aaS 9tt ccctataccc agatggctcc tctgggccac fglaagctga 
IclZlT^t f cccc 99999 gcagccatat ctacaactgg accattccag aaggccatgc 
llll^tt 9Ct9accca 9 cgtgcctcac ctggatctac cattctcatj taStgctcc 
acgagacatt gcaaotggcc taattgggcc tctcatcacc tgtaaaagag gaLcctgga 
SaaaS: ° CtCCtCaac 9«aggatg t agaccatgat tfcttccLc 5KS£gJ 
?ac a ? a S a I aacCtcagct 93oatctcaa tgagaacatt gccacttact gctcagatcc 
SacttSS f aCaaa3aa 9 atgagacatt tcaggagagc aataggatgc atgcaatcaJ 
tttgggaatt tacctgagct gaacatgtgt gcacagaaac gtgtggcctg 
* " Cat " 9Ca atgaaattga tgtccacaca icatt^ttcc LggfSaga? 
T " aCaCC acact 3 a tgt ggctaacatc tttccagcca cc??tgtgac 
tgctgagatg gtgccctggg aacctggtac ctggttaatt agctgccaag tgaacagtca 
ctttcgagat ggcatgcagg cactctacaa ggtcaagtct tgctccatgg ScctSctgt 
ggacctgctc acaggcaaag ttcgacagta cttcat?gag gSccatgagl ttcaatggX 

c^caStaao £????"a? t9 " aa9aat ttgaga^agc caggcag^a? 

atatf 3 ^ 39 "" tcca9a agagctccag ccgaattggg ggcacttact ggaaagtgcg 
S£S^ 9 * t " caa 9 at 9 agacattcca agagaagatg catttggagg aagatagLa 
tcttggaatc ctggggccag tgatccgggc tgaggtgggt gacacclttl aggWctt 
ctacaaccgt gcctcccagc cattcagcat gcagccccat ggggtctttt atSataaoa 
ctatgaaggc actgtgtaca atgatggctc atcftaccct ggJSgg^g cclagccc? 3 
ta a S aa9 J a acatacc 3 ct 9gacagtccc ccctcatgcc ggtccclctg ctcagg^cc 
cctao?^ aCtt ? 9at9t acttctctgc tgcagatccc ataagagacl caaa??ctgg 
!oS2" 9 ^ CC9Ct9ct " t3tgcagggc tggtgccttg ggtgcagatg gcaagcagS 
aggggtggat aaagaattct ttcttctctt cactgtgttg gatgagaaca agagctggta 
SEJ 9CC aat ^ aa9ca9 <=tgctatgtt ggatttccgl ctgct?tcag aggltSga 
claJS 9ac 5 CCaatc ggatgcatgc cattaatggg tttctgttcf. cILcctgfc 
caggctggac atgtgcaagg gtgacacagt ggcctggcac ctgctcggcc tgggcaclga 
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gactgatgtg catggagtca tgttccaggg caacactgtg cagcttcagg gcatgaggaa 
gggtgcagct atgctctttc ctcatacctt tgtcatggcc atcatgcagc ctgacaacct 
tgggacattt gagatttatt gccaggcagg cagccatcga gaagcaggga tgagggcaat 

ctataatgtc tcccagtgtc ctggccacca agccacccct cgccaacgct accaagctgc 2220 

aagaatctac tatatcatgg cagaagaagt agagtgggac tattgccctg accggagctg '.5fto 
ggaacgggaa tggcacaacc agtctgagaa ggacagttat ggttacattt tcctgagcaa 
caaggatggg ctcctgggtt ccagatacaa gaaagctgta ttcagggaat acactgatgg 
tacattcagg atccctcggc caaggactgg accagaagaa cacttgggaa tcttgggtcc 

acttatcaaa ggtgaagttg gtgatatcct gactgtggta ttcaagaata atgccagccg 252 0 

cccctactct gtgcatgctc atggagtgct agaatctact actgtctggc cactggctgc 258 0 

tgagcctggt gaggtggtca cttatcagtg gaacatccca gagaggtctg gccctgggcc 2640 

caatgactct gcttgtgttt cctggatcta ttattctgca gtggatccca tcaaggacat 2700 

gtatagtggc ctggtggggc ccttggctat ctgccaaaag ggcatcctgg agccccatgg 2760 

aggacggagt gacatggatc gggaatttgc attgttgttc ttgatttttg atgaaaataa 2 820 

gtcttggtat ttggaggaaa atgtggcaac ccatgggtcc caggatccag gcagtattaa 2880 

cctacaggat gaaactttct tggagagcaa taaaatgcat gcaatcaatg ggaaactcta 2940 

tgccaacctt aggggtctta ccatgtacca aggagaacga gtggcctggt acatgctggc 3 000 

catgggccaa gatgtggatc tacacaccat ccactttcat gcagagagct tcctctatcg 3 060 

gaatggcgag aactaccggg cagatgtggt ggatctgttc ccagggactt ttgaggttgt 3120 

ggagatggtg gccagcaacc ctgggacatg gctgatgcac tgccatgtga ctgaccatgt 3180 

ccatgctggc atggagaccc tcttcactgt tttttctcga acagaacact taagccctct 3240 

caccgtcatc accaaagaga ctgaaaaagc agtgcccccc agagacattg aagaaggcaa 3300 

tgtgaagatg ctgggcatgc agatccccat aaagaatgtt gagatgctgg cctctgtttt 3360 

ggttgccatt agtgtcaccc ttctgctcgt tgttctggct cttggtggag tggtttggta 3420 

ccaacatcga cagagaaagc tacgacgcaa taggaggtcc atcctggatg acagcttcaa 3480 

gcttctgtct ttcaaacagt aacatctgga gcctggagat atcctcagga agcacatctg 3540 

tagtgcactc ccagcaggcc atggactagt cactaacccc acactcaaag gggcatgggt 3600 

ggtggagaag cagaaggagc aatcaagctt atctggatat ttctttcttt atttatttta 3660 

catggaaata atatgatttc actttttctt tagtttcttt gctctacgtg ggcacctggc 3720 
actaagggag taccttatta tcctacatcg caaatttcaa cagctacatt atatttcctt wau 
ctgacacttg gaaggtattg aaatttctag aaatgtatcc ttctcacaaa gtagagacca 
agagaaaaac tcattgattg ggtttctact tctttcaagg actcaggaaa tttcactttg 
aactgaggcc aagtgagctg ttaagataac ccacacttaa actaaaggct aagaatatag 
gcttgatggg aaattgaagg taggctgagt attgggaatc caaattgaat tttgattctc 
cttggcagtg aactactttg aagaagtggt caatgggttg ttgctgccat gagcatgtac 
aacctctgga gctagaagct cctcaggaaa gccagttctc caagttctta acctgtggca 
ctgaaaggaa tgttgagtta cctcttcatg ttttagacag caaaccctat ccattaaagt 
acttgttaga acact 
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DE Human 18S rRNA gene, complete. 

<W 18S ribosomal RNA; ribosomal RNA. 



ccgtcctcga 


g c c c c c nnnii 


60 


ctacctggtt 


gatcctgcca 


120 


aagtiacgcac 


aa c caa fc. a c a 


-LOU 


ttggtcgctc 


gctcctctcc 


240 


gacgggcgct 


gacccccttc 




ggtcagcccc 


tctccggccc 


360 


taacctcoqq 


ccoatcacac 

w w w w 


*X & \J 


atcaactttc 


ciataotaatr: 


4 R 0 


ggttcgattc 


ccro aaacr crcr a 


c / n 

J *± u 


gcgcaaatta 


cccactcccg 


60 0 

O \J w 


tttcgaggcc 


ctataat taa 


££0 

D O V 


gggcaaatct 


Qatciccaar a 


7 2 0 


ctgcagttaa 


aaaGctcota 


/ O U 


agccaccgcc 


cqtccccacc 




tcccqcaqqq 


cccqaaqcat 


900 

Zs \J \J 


cgcctggata 


ccacaactaa 


ocn 

✓ DU 


cagaactcraQ 


qccataatta 




tgaaattctt 


oaaccaacac 




attaatcaag 


aacaaaaotc 




ataaacgatg 


ccgaccggcg 


1200 


gggaaaccaa 


agtctttggg 


1260 


attgacggaa 


gggcaccacc 


1320 


acctcacccg 


gcccggacac 


1380 


gggtggtggt 


gcatggccgt 


1440 


cgaacgagac 


tctggcatgc 


1500 


cttcttagag 


ggacaagtgg 


1560 


gcccttagat 


gtccggggct 


1620 


tacgccggca 


ggcgcgggta 


1680 


attattcccc 


atgaacgagg 


1740 


gtccctgccc 


tttgtacaca 


1800 


ctcggatcgg 


ccccgccggg 


1860 


acttgactat 


ctagaggaag 


1920 


ggatcatta 




1969 



Homo sapiens cell death regulator aven mRNA, complete cds 



/translation=«MQAERGARGGRGRRPGRGRPGGDRHSERPGAAAAVARGGGGGGGG 
DGGGRRGRGRGRGFRGARGGRGGGGAPRGSRREPGGWGAGASAPVEDDSDAETYGEEND 
EQGNYSK3^IVSNWDRYQDIEKEVNNESGESQRGTDFSVLIjSSAGDSFSQFRFAEEKEW 
DSEASCPKQNSAFYVDSELLVRALQELPIiCLRliNVAAELVQGT^ 

KGLGMQIiKGPLGPGGRGP IFEIiKSVAAGCPVLIiGKDNPS PGP SRDS QKPTS PLQS AGDH 
LEEELDLLIjNIjDAPIKEGDNIIjPDQTSQDLKSKEDGEVVQEEEVCAKPSVT 

QPSTSKNVTEEELEDWIjDSMIS " 



Sequence 1549 BP; 415 A; 349 C; 469 G; 314 T; 2 other; 
gggcgtctcc gcagctcggc tcccgcgcgc tcagcaccac cagcggcgcc agatgcaggc 
ggagcgagga gctcggggag gccgtgggcg gcggccaggc cgcggccggc ctggcggaga 
tcgccacagc gagcggcccg gagccgcagc ggcggtagcc agaggcggcg gcggaggcgg 
cggcggggac ggaggcggac gccggggccg tggccgtggc cggggcttcc gcggcgctcg 
cggaggccga ggaggaggag gcgccccgcg aggcagccgc cgggagccgg gaggctgggg 
cgcaggggcc agcgcgccgg ttgaagatga cagcgatgca gagacctatg gagaagagaa 
tgatgaacag ggaaattatt ctaaaagaaa gattgtctct aactgggatc gatatcaaga 
tattgaaaaa gaggtcaata atgaaagtgg agagtcacag aggggaacag atttcagtgt 
cctccttagc tctgcagggg actcattctc acagttccgg tttgctgagg agaaagaatg 
ggatagtgaa gcttcttgtc caaaacagaa ttcagcattt tatgtggata gtgagttatt 
ggttcgagcc cttcaagagc tgcctctctg cctccgactc aacgttgctg ccgaactggt 
ccagggtaca gttcctttag aggttcctca ggtgaaacca aagagaactg atgatggcaa 
gggattaggg atgcagttaa aggggccctt ggggcctgga ggaagggggc ccatctttga 
gctgaaatct gtggctgctg gctgccctgt gttgctgggc aaagacaacc caagcccggg 
tccttcaagg gattctcaga aacccacttc cccactgcag tcagcaggag accatttgga 
agaagaacta gatctgttgc ttaatttaga tgcacctata aaagagggag ataacatctt 
accagatcag acgtctcagg acctgaaatc caaggaagat ggggaggtgg tccaagagga 
agaagtttgt gcaaaaccat ctgtgactga agaaaaaaac atggaacctg agcaaccaag 
tacctccaaa aatgttaccg aggaagagct ggaagactgg ttggacagca tgatttccta 
aaaaggggga aaaaagtgcc tgaagcaaat cttggttgcc ttctaacggc aggtgggcat 
aaggctgtcc ttcaggacca gccagtttac aagcatgtct caagctagtg tgttccatta 
tgctcacagc agtaaatgcc tacctctgtg tttgacatct gaaagaatac attgaagcag 
cttgttgcat ttgtttttct ggcttagtaa tctaatagat ttccttaagg gcaggagata 
gactctggcc cttgtttcta gcctccttcc ttgcagtgtt tacaacatag ccagtgttta 
cagcatagca gatgctgctg ctggttaaga gaatagatgc aaacaaggca tgcatttggc 
caaaataaac aaatgctggt ctgtccaaaa aannaaaaaa aaaaaaaaa 
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« ?«!H 0 „ Sap:LenS interfer °n, gamma- inducible protein 16, mRNA (cDNA clone 
»E MGC:9466 IMAGE : 3 914632) , complete cds . 
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T- /translation "MGKKXKNIVliLKGLEVINDYHFllMVKSLLSNDLKIiNLKMREEyDK 

A IQIADLMEEKFRGDAGLGKLIKIFEDIPTLEDLAETLKKEKLKVKGPALSRKRKKEVDA 

TSPAPSTSSTVKTEGAEATPGAQKRKKSTKEKAGPKGSKVSEEQTQPPSPAGAGMSTAM 
GRSPSPKTSLSAPPNTSSTENPKTVAKCQVTPRRNVLQKRPVIVKVIjSTTKPPEYETPE 
MEKXIMFHATVATQTQFFHVKVIjNTSLKEKFNGKKZIIISDYIjEYDSLIjEVNEESTVSE 
AGPNQTFEVPNKIINPJOCETLKIDILHTCQASGNIVYGVFMLHKKTVNQKTTIYEIQDDR 
GKTffiVVGTGQCHNIPCEEGDKLQLFCFRIiRKKNQMSIOjISEMHSFIQIKKKTNPRNNDP 
KSMKLPQEQSQLPNPSEASTTFPESHIjRTPQMPPTTPSSSFFTKKSEDTISKMNDFMRM 

qilkegshfpgpfmtsigpaeshphtpqmppstpsssflttlkprlktepeevsiedsa 

QSDLKEVMVLNATES FVYEPKEQKKMFHATVATENEVFRVKVFNIDLKEKFTPKKI I AI 
ANYVCRNGFLEVY PFTLVADVNADRNME I PKGLIRS AS VTPKINQLCSQTKGS FVNGVF 

EVHKTO^RGEFTYYEIQDNTGKMEVVVHGRIjTTINCEEGDKIjKLTCFEIiAPKSGNTGEL 
RSVIHSHIKVIKTRKNKKD1LNPDSSMETSPDFFF » 

Sequence 2709 BP; 964 A; 541 C; 544 G; 660 T; 0 other; 

gcagaatagg agcaagccag cactagtcag ctaactaagt gactcaacca aggccttttt 60 

tccttgttat ctttgcagat acttcatttt cttagcgttt ctggagatta caacatcctg 12 0 

cggttccgtt tctgggaact ttactgattt atctcccccc tcacacaaat aagcattgat 180 

tcctgcattt ctgaagatct caagatctgg actactgttg aaaaaatttc cagtgaggct 240 

cacttatgtc tgtaaagatg ggaaaaaaat acaagaacat tgttctacta aaaggattag 3 00 

aggtcatcaa tgattatcat tttagaatgg ttaagtcctt actgagcaac gatttaaaac 360 
ttaatttaaa aatgagagaa gagtatgaca aaattcagat tgctgacttg atggaagaaa 
agttccgagg tgatgctggt ttgggcaaac taataaaaat tttcgaagat ataccaacgc 
ttgaagacct ggctgaaact cttaaaaaag aaaagttaaa agtaaaagga ccagccctat 

caagaaagag gaagaaggaa gtggatgcta cttcacctgc accctccaca agcagcactg 600 

tcaaaactga aggagcagag gcaactcctg gagctcagaa aagaaaaaaa tcaaccaaag 660 

aaaaggctgg acccaaaggg agtaaggtgt ccgaggaaca gactcagcct ccctctcctg 720 

caggagccgg catgtccaca gccatgggcc gttccccatc tcccaagacc tcattgtcag 780 

ctccacccaa cacttcttca actgagaacc cgaaaacagt ggccaaatgt caggtaactc 840 

ccagaagaaa tgttctccaa aaacgcccag tgatagtgaa ggtactgagt acaacaaagc 900 

catttgaata tgagacccca gaaatggaga aaaaaataat gtttcatgct acagtggcta 960 

cacagacaca gttcttccat gtgaaggttt taaacaccag cttgaaggag aaattcaatg 1020 

gaaagaaaat catcatcata tcagattatt tggaatatga tagtctccta gaggtcaatg 1080 

aagaatctac tgtatctgaa gctggtccta accaaacgtt tgaggttcca aataaaatca 1140 

tcaacagagc aaaggaaact ctgaagattg atattcttca caaacaagct tcaggaaata 1200 
ttgtatatgg ggtatttatg ctacataaga aaacagtaaa tcagaagacc acaatctacg 
aaattcagga tgatagagga aaaatggatg tagtggggac aggacaatgt cacaatatcc 
cctgtgaaga aggagataag ctccaacttt tctgctttcg acttagaaaa aagaaccaga 

tgtcaaaact gatttcagaa atgcatagtt ttatccagat aaagaaaaaa acaaacccga ± «* v 

gaaacaatga ccccaagagc atgaagctac cccaggaaca gagtcagctt ccaaatcctt 1500 

cagaggccag cacaaccttc cctgagagcc atcttcggac tcctcagatg ccaccaacaa 1560 
ctccatccag cagtttcttc accaagaaaa gtgaagacac aatctccaaa atgaatgact 
tcatgaggat gcagatactg aaggaaggga gtcattttcc aggaccgttc atgaccagca 
taggcccagc tgagagccat ccccacactc ctcagatgcc tccatcaaca ccaagcagca 

gtttcttaac cacgttgaaa ccaagactga agactgaacc tgaagaagtt tccatagaag 1SUU 

acagtgccca gagtgacctc aaagaagtga tggtgctgaa cgcaacagaa tcatttgtat i860 

atgagcccaa agagcagaag aaaatgtttc atgccacagt ggcaactgag aatgaagtct 192 0 

tccgagtgaa ggtttttaat attgacctaa aggagaagtt caccccaaag aagatcattg 1980 

ccatagcaaa ttatgtttgc cgcaatgggt tcctggaggt atatcctttc acacttgtgg 2040 

ctgatgtgaa tgctgaccga aacatggaga tcccaaaagg attgattaga agtgccagcg 2100 

taactcctaa aatcaatcag ctttgctcac aaactaaagg aagttttgtg aatggggtgt 2160 

ttgaggtaca taagaaaaat gtaaggggtg aattcactta ttatgaaata caagataata 2220 

cagggaagat ggaagtggtg gtgcatggac gactgaccac aatcaactgt gaggaaggag 2280 

ataaactgaa actcacctgc tttgaattgg caccgaaaag tgggaatacc ggggagttga 2340 
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gatctgtaat tcatagtcac atcaaggtca tcaagaccag gaaaaacaag aaagacatac 24 00 

tcaatcctga ttcaagtatg gaaacttcac cagacttttt cttctaaaat ctggatgtca 2460 

ttgacgataa tgtttatgga gataaggtct aagtgcctaa aaaaatgtac atatacctgg 2520 

ttgaaataca acactataca tacacaccac catatatact agctgttaat cctatggaat 2580 

ggggtattgg gagtgctttt ttaatttttc atagtttttt tttaataaaa tggcatattt 264 0 

tacatctaca acttctataa tttgaaaaaa taaataaaca ttatcttttt tgtgaaaaaa 2700 

3 2709 
aaaaaaaaa 



Homo sapiens guanylate binding protein 1, interferon- inducible 67kn a mWMa 
(CDNA clone MGC:3949 IMAGE : 3606865) , complete cds . ln * UClhl * > 67kDa ' mRNA 

/protein_id="AAH02666 . 1" 

/trans lation= "MASEIHMTGPMCLIENTNGRLMANPEALKILSAITQPMVVVAIVG 
L YRTGKS YLMNKLAGKKKGPSLGSTVQSHTKG I WMWCVPHPKKPGH ILVLLDTEGLGD V 
EKGDNQNDS WI FALAVLLS STFVYNS IGTINQQAMDQLYYVTELTHRIRSKSSPDENEN 
EVEDSADFVSFFPDFVWTLRDFSLDLEADGQPLTPDEYLTYSLKLKKGTSQKDETFNLP 
RLCIRKFFPKKKCFVPDRPVHRRKIAQLEKLQDEELDPEFVQQVADFCSYIFSNSKTKT 
LSGG1QVNGPRLESLVLTYWAISSGDLPCMENAVIALAQIENSAAVQKAIAHYEQQMG 
QKVQLPTESLQEIi1jDIjH3U3SEREA1EVFIRSSFKDVDHLFQKEIjAAQIjEKKRDDFCKQN 
QEASSDRCSGLLQVIFSPLEEEVKAGIYSKPGGyRLFVQKLQDLKKKYYEEPRKGIQAE 
EI^TYLKSKESMTOAILQTOQTLTEKEKEIEVERVKAESAQASAKMLQEMQRKNEQMM 

EQKERSYQEHLKQLTEKMENDRVQLIiKEQERTIiAIiKLQEQEQLLKEGFQKESRIMKNEI 
QDLQTKMRRRKACTIS" 



ggagtcagtg atttgaacga agtactttca gtttcatatt actctaaatc cattacaaat 
ctgcttagct tctaaatatt tcatcaatga ggaaatccca gccctacaac ttcggaacag 
tgaaatatta gtccagggat ccagtgagag acacagaagt gctagaagcc agtgctcgtg 
aactaaggag aaaaagaaca gacaagggaa cagcctggac atggcatcag agatccacat 
gacaggccca atgtgcctca ttgagaacac taatgggcga ctgatggcga aLcagaagc 
tctgaagatc ctttctgcca ttacacagcc tatggtggtg gtggcaattg tgggcctcla 
ccgcacaggc aaatcctacc tgatgaacaa gctggctggl Lglaaaagg gfSctctct 
gggctccacg gtgcagtctc acactaaagg aatctggatg tggtgtgtgc cccaccccaa 
f«™S- CaCatC f ta 9 ttctgctgga caccgagggt ctgggagatg tagagaaggg 
tgacaaccag aatgactcct ggatcttcgc cctggccgtc ctcctgagca gcaccttci? 
gtacaatagc ataggaacca tcaaccagca ggctatggac caactgtact atgtgacaga 
gctgacacat agaatccgat caaaatcctc acctgatgag aatgagaatg aggttgagga 
ttcagctgac tttgtgagct tcttcccaga ctttgtgtgg acactgagag aStcLSS 
ggacttggaa gcagatggac aacccctcac accagatgag tacctgacat actccctgaa 
gctgaagaaa ggtaccagtc aaaaagatga aacttttaac ctgcccagac tctgtatccg 
gaaattcttc ccaaagaaaa aatgctttgt ctttgatcgg cccgttcacc gcaggaagct 
tgcccagctc gagaaactac aagatgaaga gctggacccc gaatttgtgc aacaagtagc 
agacttctgt tcctacatct ttagtaattc caaaactaaa actcttLag gaggcXcS 
ggtcaacggg cctcgtctag agagcctggt gctgacctac gtcaatgcca tcagcagtgg 
ggatctgccg tgcatggaga acgcagtcct ggccttggcc cagatagaga actcagctgc 
!^f! aaaa9 SCtattgccc actatgaaca gcagatgggc cagaaggtgc agctgcccac 
agaaagcctc caggagctgc tggacctgca cagggacagt gagagagagg ccattgaagt 
cttcatcagg agttccttca aagatgtgga ccatctattt caaaaggal? tagcglcSca 
SSSUTE" aa 3°gggatg acttttgtaa acagaatcag gaagcatcat cagatcgttg 
ctcaggttta cttcaggtca ttttcagtcc tctagaagaa gaagtgaagg cgggaattta 
ttcgaaacca gggggctatc gtctctttgt tcagaagcta caagacctga agaaaaagta 
ctatgaggaa ccgaggaagg ggatacaggc tgaagagatt ctgcagacat acttgaaatc 

^gactgatg caattctcca gacagaccag actctcacag aaaaagaaaa 1680 
t 9 9t 9 f 9 aac 9 t 9 tgaaagctga gtctgcacag gcttcagcaa aaatgttgca 1740 
ggaaatgcaa agaaagaatg agcagatgat ggaacagaag gagaggagtt atcaggaaca 1800 
cttgaaacaa ctgactgaga agatggagaa cgacagggtc cagttgctga aagagcaaga 
gaggaccctc gctcttaaac ttcaggaaca ggagcaacta ctaaaagagg gatttcaaaa 
agaaagcaga ataatgaaaa atgagataca ggatctccag acgaaaatga gacgacgaaa 
ggcatgtacc ataagctaaa gaccagagcc ttcctgtcac ccctaaccaa ggcataattg 
aaacaatttt agaatttgga acaagcgtca ctacatttga taataattag atcttgcatc 
ataacaccaa aagtttataa aggcatgtgg tacaatgatc aaaatcatgt tttttcttaa 
aaaaaaaaaa aaaaaa 



60 
120 
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420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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Homo sapiens interferon induced transmembrane protein 1 (9-27) , mRNA (cDNA 
clone MGC:5195 IMAGE : 3464598) , complete cds. 

/ trans lat ion= " MHKEEHEVAVLGAPPSTIIiPRSTVINIHSETSVPDHVVWSLFNTL 
FLNWCCLGF IAFAYS VTCSRDRKMVGDVTGAQAYASTAKCLNI WALI IiG ILiMT IGF illl 
VFGSVTVYHIMLQI IQEKRGY n 

aaacgacagg ggaaaggagg tctcactgag caccgtccca gcatccggac accacagcgg 
cccttcgctc cacgcagaaa accacacttc tcaaaccttc actcaacact tccttcccca 
aagccagaag atgcacaagg aggaacatga ggtggctgtg ctgggggcac cccccagcac 
catccttcca aggtccaccg tgatcaacat ccacagcgag acctccgtgc ccgaccatgt 
cgtctggtcc ctgttcaaca ccctcttctt gaactggtgc tgtctgggct tcatagcatt 
cgcctactcc gtgaagtcta gggacaggaa gatggttggc gacgtgaccg gggcccaggc 
ctatgcctcc accgccaagt gcctgaacat ctgggccctg attctgggca tcctcatgac 
cattggattc atcctgttac tggtattcgg ctctgtgaca gtctaccata ttatgttaca 
gataatacag gaaaaacggg gttactagta gccgcccata gcctgcaacc tttgcactcc 
actgtgcaat gctggccctg cacgctgggg ctgttgcccc tgcccccttg gtcctgcccc 
tagatacagc agtttatacc cacacacctg tctacagtgt cattcaataa agtgcacgtg 
cttgtgaaaa aaaaaaaaaa aaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 

683 



DE 



Homo sapiens transcription factor ISGF-3 mRNA, complete cds. 



FT 
FT 
FT 
FT 
FT 



/ 1 r ans 1 a t i on= n MSQWYELQQLDSKFLEQVHQL YDDS FPME IRQYLAQWLEKQDWEH 
AANDVSFATIRPHDLLSQLDDQYSRFSIiENNFIjIiQHNIRKSKRNIjQDNFQEDPIQMSMI 
IYSCLKEBRKILENAQRFNQAQSGNIQSTVl^KQKELDSKVRNVKDKVMCIEHEIKSL 
EDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQLLLKKMyiiMLDNKRKEVVHKIIELIiN 
VTELTQNALIlTOELVEWKRRQQSACIGGPPNACIiDQIiQNWFTIVAESLQQVRQQLKKLE 
H ELEQInmf ^ p ITKNKQVLra)RTFSLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQ 
I! FTAnOJUiLVKLQELNYNLKVKVLFDKDVNEFjmnCGFRKFNILGTHT^^ 
ll LAAEFRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQIiCQPGLVIDLETTSLPVVVI 
ll SNVSQLPSGWASILWYNMliVAEPRlTIjSFFLTPPCARWAQLSErVLSWQFSSVTKRGLNVD 
* QLNMLGEKIiLGPNASPDGLIPWTRFCKENINDKNFPFWLWIESIIjEIjIKKHLLPLWNDG 

CIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT 
KKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD 
GPKGTGYIKTEIilSVSEYHPSRLQTTDNLLPMSPEEFDEVSRIVGSVEFDSMMNTV" 



60 
120 



?T 
?T 
?T 
CX 

3Q Sequence 4003 BP; 1173 A; 812 C; 883 G; 1135 T; 0 Other- 

attaaacctc tcgccgagcc cctccgcaga ctctgcgccg gaaagtttca tttgctgtat 
gccatcctcg agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc 
ttggcaccta acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg 180 
ggcacaaggt ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct 240 
SSES? Cacca 9 cttt atgatgacag ttttcccatg gaaatcagac agtacctggc 300 
! gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat 360 
ccgttttcat gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa 420 
taacttcttg ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca 480 
lllllTaT* atcca 9 at 9 t ctatgatcat ttacagctgt ctgaaggaag aaaggaaaat 540 
tctggaaaac gcccagagat ttaatcaggc tcagtcgggg aatattcaga gcacagtgat 600 
gttagacaaa cagaaagagc ttgacagtaa agtcagaaat gtgaaggaca aggttatgtg 660 
^^^ a9Cat S aaatcaa 9* gcctggaaga tttacaagat gaatatgact tcaaatgcaa 72 0 

llttll^tl aaca 9 a 9 aac acgagaccaa tggtgtggca aagagtgatc agaaacaaga 780 
acagctgtta ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca 840 
a aaa S aa ' a f agtt9Ct 9 a -tgtcactga acttacccag aatgccctga ttaatgatga loo 
actagtggag tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg 960 
cttggatcag ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca 10 20 
111-1,*** aa 9 fc ^9 a 99 aattggaaca gaaatacacc tacgaacatg accctatcac 1080 
aaaa f a " aa caa 9 t 9 tta t gggaccgcac cttcagtctt ttccagcagc tcattcagag n 4 0 
ctcgtttgtg gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt 1200 
tttt^f 9 9tcca 9 ttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa 1260 
aaoaK^™ aaa ?* Caaa 9 tcttatttga taaagatgtg aatgagagaa atacagtLa 1320 
aggatttagg aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc 1380 
caccaatggc agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaalaaK 1440 
tggcaccaga acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt 1500 
tgaaacccaa ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt 
tgtggtgatc tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa 
catgctggtg gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg 
ggctcagctt tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa 
tgtggaccag ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg 
tctcattccg tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg 
gctttggatt gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga 
tgggtgcatc atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca 
gccggggacc ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac 
atgggtggag cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac 
gaagaaagaa ctttctgctg ttactttccc tgacatcatt cgcaattaca aagtcatggc 
tgctgagaat attcctgaga atcccctgaa gtatctgtat ccaaatattg acaaagacca 2220 
tgcctttgga aagtattact ccaggccaaa ggaagcacca gagccaatgg aacttgatgg 2280 
ccctaaagga actggatata tcaagactga gttgatttct gtgtctgaag ttcacccttc 2340 
tagacttcag accacagaca acctgctccc catgtctcct gaggagtttg acgaggtgtc 2400 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



tcggatagtg ggctctgtag aattcgacag tatgatgaac acagtataga gcatgaattt 2460 
ttttcatctt ctctggcgac agttttcctt ctcatctgtg attccctcct gctactctgt 2520 
tccttcacat cctgtgtttc tagggaaatg aaagaaaggc cagcaaattc gctgcaacct 2580 
gttgatagca agtgaatttt tctctaactc agaaacatca gttactctga agggcatcat 
gcatcttact gaaggtaaaa ttgaaaggca ttctctgaag agtgggtttc acaagtgaaa 
aacatccaga tacacccaaa gtatcaggac gagaatgagg gtcctttggg aaaggagaag 
ttaagcaaca tctagcaaat gttatgcata aagtcagtgc ccaactgtta taggttgttg 
gataaatcag tggttattta gggaactgct tgacgtagga acggtaaatt tctgtgggag 
aattcttaca tgttttcttt gctttaagtg taactggcag ttttccattg gtttacctgt 
gaaatagttc aaagccaagt ttatatacaa ttatatcagt cctctttcaa aggtagccat 
catggatctg gtagggggaa aatgtgtatt ttattacatc tttcacattg gctatttaaa 
gacaaagaca aattctgttt cttgagaaga gaatattagc tttactgttt gttatggctt 
aatgacacta gctaatatca atagaaggat gtacatttcc aaattcacaa gttgtgtttg 
atatccaaag ctgaatacat tctgctttca tcttggtcac atacaattat ttttacagtt 
ctcccaaggg agttaggcta ttcacaacca ctcattcaaa agttgaaatt aaccatagat 3300 
gtagataaac tcagaaattt aattcatgtt tcttaaatgg gctactttgt cctttttgtt 3360 
attagggtgg tatttagtct attagccaca aaattgggaa aggagtagaa aaagcagtaa 3420 
ctgacaactt gaataataca ccagagataa tatgagaatc agatcatttc aaaactcatt *Afln 
tcctatgtaa ctgcattgag aactgcatat gtttcgctga tatatgtgtt tttcacattt 
gcgaatggtt ccattctctc tcctgtactt tttccagaca cttttttgag tggatgatgt 
ttcgtgaagt atactgtatt tttacctttt tccttcctta tcactgacac aaaaagtaga 3660 
ttaagagatg ggtttgacaa ggttcttccc ttttacatac tgctgtctat gtggctgtat 3720 
cttgtttttc cactactgct accacaacta tattatcatg caaatgctgt attcttcttt 3780 
ggtggagata aagatttctt gagttttgtt ttaaaattaa agctaaagta tctgtattgc 3840 
attaaatata atatcgacac agtgctttcc gtggcactgc atacaatctg aggcctcctc 3900 
tctcagtttt tatatagatg gcgagaacct aagtttcagt tgattttaca attgaaatga ««o 
ctaaaaaaca aagaagacaa cattaaaaac aatattgttt eta 



2640 
2700 
2760 
2820 
2880 
2940 
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3060 
3120 
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3480 
3540 
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3960 
4003 



Homo sapiens phospholipid scramblase 1, mRNA (cDNA clone IMAGE: 4253596) 
coinplete cds . ' 



/ translation= "MDKQNSQMNASHPETNLPVGYPPQYPPTAFQGPPGYSGYPGPQVS 

YPPPPAGHSGPGPAGFPVPNQPVYNQPVYNQPVGAAGVPWMPAPQPPIiNCPPGLEYLSO 
VI SKTQNTHKKQNCASSLliNQI SK n 

Sequence 1143 BP; 370 A; 241 C; 217 G; 315 T; 0 other ; 

gagaaggttg cgcagcagct gtgcccggca gtctagaggc gcagaagagg aagccatcgc 60 

ctggccccgg ctctctggac cttgtctcgc tcgggagcgg aaacagcggc agccagagaa 120 

ctgttttaat catggacaaa caaaactcac agatgaatgc ttctcacccg gaaacaaact 180 

tgccagttgg gtatcctcct cagtatccac cgacagcatt ccaaggacct ccaggatata 24 0 

gtggctaccc tgggccccag gtcagctacc cacccccacc agccggccat tcaggtcctg 300 

gcccagctgg ctttcctgtc ccaaatcagc cagtgtataa tcagccagta tataatcagc 360 

cagttggagc tgcaggggta ccatggatgc cagcgccaca gcctccatta aactgtccac 420 

ctggattaga atatttaagt caggtaattt caaagacaca aaatactcat aaaaaacaga 480 

actgtgcttc cagcttgctt aaccagatta gcaaatgaat aattcaccaa agtctgaaat 540 

agcaaaactg tatttcctgc taacagatta ctctaattct tctaggtctg gttcaatttt 600 

aaagcaaaat acaaatgcct tagaaaattg tattttctgt tatcttaaat acaatctatg 660 

ataatggcca atagcaaaca tttaattagc actgtttcct gcctttgttg tatgcctgat 720 

acatgtatta actcatttaa tccttattga aagtctgtga tgtataggtg ctacattttt 780 

caaaagaaga aacagaggtc cagagaggtt atatagctca ctctggggtg agaacctaaa 840 

gagtcaagac tgttttttta atcccgaaac tttggtactg agcgaagtgc tctttagtct 900 
caatactgaa taattgcctt ataatttgga agaaaattta aataaagttt attgttgagt 
ttcaataagt ggccccaaac aaggggttaa tattttatgt gtaatatgac tcaccttttt 
attgtaacta ataaaactgc atttttatga tgctgctttt gttctttgga agacctaatt 
ttataaatgc cattaataaa ggagtaaaaa gccaaaaaaa aaaaaaaaaa aaaaaaaaaa 



aaa 
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Homo sapiens metalloprotease disintegrin cysteine-rich protein, secreted 
form mRNA, complete cds . 

/translation "NO^QGLLPVSLIiL^^ 

EPEQQEQFETELKYKMTINGKIAVLYIiKKNKOTjLAPGYTET^ 

CYYQGHILNEKVSDASISTCRGLRGYFSQGDQRYFIEPLSPIHIUDGQEHAIjFKyNPDEK 

NYDSTCGMDGVLWAHDLQQNIALPATKLVK^^ 

NQDEIRKRVFEMANYVNMLYKKLNTHVALV 

SVLSRRKRHD IAQLITATEIiAGTTVGLAFMSTMCS PYS VGVVQDHSDNLIiRVAGTMAHE 
MGHNFGMFHDDYSCKCPSTICVMDKALSFYIPTDFSSCSRLSYDKFFEDKLSNCIjFNAP 
IjPTDIISTPICGNQLVEMGEDCTCGTSEECTNICCH^AKTCKIKATFQCAIjGECCEKCQF 
KKAGMVCRPAKDEOTLPEMCNGKSGNCPDDRFQWGFPCHHGKGHCIiMGTCPTLRE 
ELWGPGRRTNPFPCACAKENHFR" 

Sequence 2087 BP; 657 A;. 376 C; 478 G; 576 T; 0 other; 
gcgagaagag cagacaccgt gctcctggaa tcacccagca tgttgcaagg tctcctgcca 
gtcagtctcc tcctctctgt tgcagtaagt gctataaaag aactccctgg ggtgaagaag 
tatgaagtgg tttatcctat aagacttcat ccactgcata aaagagaggc caaagagcca 
gagcaacagg aacaatttga aactgaatta aagtataaaa tgacaattaa tggaaaaatt 
gcagtgcttt atttgaaaaa aaacaagaac ctccttgcac caggctacac ggaaacatat 
tataattcca ctggaaagga gatcaccaca agcccacaaa ttatggatga ttgttattat 
caaggacata ttcttaatga aaaggtttct gacgctagca tcagcacatg taggggtcta 
aggggctact tcagtcaggg ggatcaaaga tactttattg aacctttaag ccccatacat 
cgggatggac aggagcatgc actcttcaag tataaccctg atgaaaagaa ttatgacagc 
acctgtggga tggatggtgt gttgtgggcc cacgatttgc agcagaacat tgccctacct 
gccaccaaac tagtaaaatt gaaagacagg aaggttcagg aacatgagaa atacatagaa 
tattatctgg tcctggataa tggtgagttt aaaaggtaca atgagaatca agatgagatc 
agaaagaggg tatttgagat ggctaattat gtcaacatgc tttataaaaa gctcaatact 
catgtggcct tagttggtat ggaaatctgg actgacaagg ataagataaa gataacccca 
aatgcaagct tcaccttgga gaatttttct aaatggaggg ggagtgttct ctcaagaaga 
aagcgtcatg atattgctca gttaatcaca gcaacagaac ttgctggaac gactgtgggt 
cttgcattta tgtctacaat gtgttctcct tattctgttg gcgttgttca ggaccacagc 
gataatcttc ttagagttgc agggacaatg gcacatgaaa tgggccacaa ctttggaatg 
tttcatgacg actattcttg caagtgtcct tctacaatat gtgtgatgga caaagcactg 
agcttctata tacccacaga cttcagttcc tgcagccgtc tcagctatga caagtttttt 
gaagataaat tatcaaattg cctctttaat gctccattgc ctacagatat catatccact 
ccaatttgtg ggaaccagtt ggtggaaatg ggagaggact gtgattgtgg gacatctgag 
gaatgtacca atatttgctg tgatgctaag acatgtaaaa tcaaagcaac ttttcaatgt 
gcattaggag aatgttgtga aaaatgccaa tttaaaaagg ctgggatggt gtgcagacca 
gcaaaagatg agtgcgacct gcctgaaatg tgtaatggta aatctggtaa ttgtcctgat 
gatagattcc aagtcaatgg cttcccttgc catcacggga agggccactg cttgatgggc 
acatgcccca cactgcggga gcagtgcaca gagctgtggg gaccaggtag gaggacaaat 
cctttcccct gtgcatgtgc gaaggaaaat catttcagat gacagtgttt aaccatggtc 
aaaagaccat tctgtcctat ccttcttaga agcttcgaac tcaaaatcat ggaaaggttt 
taagatttga ggttggtttt agggttgcta gatttagcaa gtaaaaataa ggatggcccc 
gttaaatttt aacttaaaat taacaagttt tttgttaatt ttttgttttt tgtctcagca 
tcagtatatc ccatgcaata tttgaggtgt gctcatacta aaattatttg tgtatctgaa 
attcaaatta aactgggtgt ctttttcttt tcatctggca accctactaa gatcataaac 
ccttggaaat ctgtgtgtgt gcgggtgtgt gtgtgtgtgt gtgtgcaggg gtggcagaag 
tactgtggga tgggacagaa ataaaaaaaa aaaaaaaaaa aaaaaaa 
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Homo sapiens matrix metalloproteinase 7 (matrilysin, uterine) , mRNA (cDNA 
clone MGC:3913 IMAGE: 3545760) , complete cds . 

/ translations " MIUjTVLCAVCLLPGSIiALPLPQEAGGMSELQWEQAQD YLKRFYLY 
DSETKNANSLEAKLKEMQKFFGLPITGMI^ 

TSKVVTYRIVSYTRDLPHITVDRLVSKAIiNMWGKEIPIjHFRKVVW 

GDS YPFDGPGNTLAHAF APGTGLGGDAHFDEDERWTDGS SLG INFLYAATHELGHSLGM 
GHS SDPNAVMYPTYGNGDPQNFKLSQDD I KGI QKLYGKRSNSRKK 11 

gtccaagaac aattgtctct ggacggcagc tatgcgactc accgtgctgt 
cctgctgcct ggcagcctgg ccctgccgct gcctcaggag gcgggaggca 
acagtgggaa caggctcagg actatctcaa gagattttat ctctatgact 
aaatgccaac agtttagaag ccaaactcaa ggagatgcaa aaattctttg 
aactggaatg ttaaactccc acgtcataga aataatgcag aagcccagat 
agatgttgca gaatactcac tatttccaaa tagcccaaaa tggacttcca 
ctacaggatc gtatcatata ctcgagactt accgcatatt acagtggatc 
aaaggcttta aacatgtggg gcaaagagat ccccctgcat ttcaggaaag 
aactgctgac atcatgattg gctttgcgcg aggagctcat ggggactcct 
tgggccagga aacacgctgg ctcatgcctt tgcgcctggg acaggtctcg 
tcacttcgat gaggatgaac gctggacgga tggtagcagt ctagggatta 
tgctgcaact catgaacttg gccattcttt gggtatggga cattcctctg 
agtgatgtat ccaacctatg gaaatggaga tccccaaaat tttaaacttt 
tattaaaggc attcagaaac tatatggaaa gagaagtaat tcaagaaaga 
tcaggcagaa catccattca ttcattcatt ggattgtata tcattgttgc 
ttgataagca ctgttcttcc actccattta gcaattatgt cacccttttt 
ggtttttgaa tgtctttcac tccttttaag gataaactcc tttatggtgt 
tattcatcta tacttgcagt gggtagatgt caataaatgt f tacatacaca 
atgtttattc catggtaaat ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



gtgctgtgtg 


60 


t gag t gage t 


120 


cagaaacaaa 


180 


gcctacctat 


240 


gtggagtgcc 


300 


aagtggtcac 


360 


gattagtgtc 


420 


ttgtatgggg 


480 


acccatttga 


540 


gaggagatgc 
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acttcctgta 


660 


atectaatge 


720 


cccaggatga 


780 


aatagaaact 


840 


acaatcagaa 


900 


tattgeagtt 


960 


gactgtgtct 


1020 


aataaataaa 


1080 



Homo sapiens cDNA FLJ10650 fis, clone NT2RP2005853 . 
fis (full insert sequence); oligo capping. 

/ trans 1 a t i on= n MGLSHSKTHIiRVT KVAPIiQNKEVETP S AGRVDF AFNQNLEEKTS Y 
SLARI1QDQNKAI1EGQLPPLQENWYGRYSTASRDMYFDI PliEHRETS I IKRHPPQRJjQKL 
E P IDLPRVT T S GRLIj S QREARTMHKAKQ VLEKKMQTPMYT S ENRQ YIjHKMQVIjEM I RKR 
QEAQMELKKSIfiGEARIl^QSPRDHK^^ 

DTEFIiKHQAVNNCCPWKIGKMETWIjHEQEAQGQIjLWDS S S SD SDEQGKDEKKPRALVRT 
RTERIPLFDEFFDQE n 

Sequence 2505 BP; 851 A; 510 C; 522 G; 622 T; 0 other; 
tagaaggatg ccatgaagga aatgactgct ttgtaaagcg agggtaaact tctgaaatgc 
tttgattaaa ataagctata ttaaagaccc caaaaccact tccctcgcag ctttcctctg 
aatgtctttc acatgaaatg ggcctgagtc actctaagac tcaccttagg gtgatcaaag 
tagcaccttt gcaaaacaaa gaggtagaga ctccctcggc tggccgtgtg gactttgcat 
tcaatcagaa tttggaagaa aagacttcat attcactggc aagactgcag gaccagaata 
aagccttgga agggcagctg ccacctttac aagaaaactg gtatggaaga tattctacag 
catccagaga catgtatttt gacatcccac tggaacacag agaaacaagt attattaaaa 
ggcatccacc ccaaagactt caaaagcttg aacccattga cttgccacga gtaattactt 
caggaagact cctgagccag cgagaagcca ggacaatgca caaagcaaag caggtactag 
aaaagaaaat gcaaactcca atgtatactt ctgagaacag acaatatttg cataagatgc 
aagtgctgga aatgatccgt aaaagacaag aggcccaaat ggagttaaag aaaagtcttc 
atggagaggc aagaattaat aagcaaagtc caagggacca taaagccaag aaaacccttc 
aaagcacccc aaggaatgat gaccatgacc ttctaaccat gttgcctgat gaaatcttga 
acagaggtcc cggaaattca aaggatacag aatttttgaa acatcaagca gtgaataact 
gctgtccctg gaaaattggc aaaatggaaa catggcttca tgaacaagag gcccagggac 
agcttctctg ggacagttcc agctctgact cagatgagca ggggaaagat gagaagaagc 
cacgagcact ggtgaggacc aggacagaga gaatcccact tttcgatgag ttttttgatc 
aagaataaga atactattca ttaacctaga aactgagtgc tttgaaagct tgttttactc 
tcaaaatctt ccaaactgat atatgaatta ctttgaggac agcaaatcac tttggtaaaa 
agaaatgata tctttagagt cttatgatta acaagtccgt cacatgtgct gttaactatt 
gctgcatcac taaatgcctc aaaacacagg ggctaaccaa gagccatttt attgtctcac 
tttcctgtgg gttgctgagt tcagctaggt ggttcttctg ctgttccctt ttgaaatctt 
tcatgtcatt gaaattggac tacatttgga ctactaagac tggagtcatc cagaggcccc 
attaggacac tgacacaggg ggaccttgcc ccctatccat gtagcctcag agcatctgca 
tatgatctct tcagcaggac actgcagaga tttctctcat atgagcccag ggctcccaaa 
agcaattgtc tcaagaggag gaaacagaag ttgccagtct tcttaaaagc tagtcataga 
actgactttg catcacttcc accatgttct gttggttaaa gtcatagacc agcttggagt 
taatgcagaa gaggactaca caagggcatg agtgccaaag tgtggtcctt tggggttcat 
ctttgacatt tgccacctca ccatgttttt aaaaagaaaa ttagattaca taaaacaaat 
agatgggctg gatgtggtgg ctcacacctg taatcccagc actttgggag gccgaggtgg 
gcagatcact tgaggtcagg agttcaagac cagcctggcc aacatggtga gacaccgtct 
ctactaaaat acaaaaatta gccagacatg gtggcattcg tctgtaatcc cagctacttg 
atgcgaaggc tgaggcaaaa gaattgcttg aacccaggag atagaggttg caatgagcca 
agatcactcc actgcactcc agcctgggcg acagaatgag actctgtctc aaaattaaaa 
aacaaaaaac caaaaacaaa tagatgaaaa agtagactgg agacaaataa aagtgagttt 
ctaaaggaaa ttcacagtaa tgctgcatta aacactaagc tcacttaggt cactttctag 
tgagctaacc gtaacagaga gcctacagga tacacgtgag ataatgtcac gtgtagaaga 
tcgttgtgaa ttaaagttca aaattaagac ttcttagatt atgatgtaga ttttagagct 
ccttaaaaca taaagcgaat cttataaatg ttcaattcta aagttattcc acttggaaaa 
attagctttt gggacaattt ttaagaactt ttgtgtaaat gcagctccat gtttagcata 
atctaaaaat aatttcaagc aatccagaat cttccaagaa tttattaaag ctttaaaaca 
aagcaaaaca aaaagaccct tttgtgcctt atatgggaag actcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2505 



E 



Homo sapiens transcription factor ISGF-3 mRNA, complete cds. 



transcription factor. 



T /translation^ 11 MS QWYELQQLDS KFLEQVHQLYDDS FPME I RQ YLAQWLEKQDWEH 

T AANDVSFATIRFHDLLSQLDDQYSRFSLENNFIjIiQHNIRKSKRNLQDNFQ^ 

r IYSCLKEERKILENAQRPNQAQSGNIQSTVMLDKQKELDSKVRNVKDKVMC 

T EDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQLLLKKMYIiM^ IEIOiN 

T VTELTQNAL INDELVEWKRRQQSAC I GGPPNACLDQLQNWFTIVAESLQQVRQQLKKLE 

T ELEQKYTYEHDPITKNKQVIiVroRTFSIiFQQLIQSSFVVERQPCMPTHPQRPLV^ 

T FTVKLRIiLVKLQELNYmiKVKVLFDra 

T lAAEFRHLQIiKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLP 

T SNVSQLPSGWAS ILWYNMLVAEPRNLS FFLTPPCARWAQLSEVLSWQFSSVTKRGLNVD 

T QLNMLGEKIiLGPNASPDGLIPWTRFCKENINDK^PFWLWIESILEIjIKKHL 

r CIMGFISKERERAIiLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT 

r KKELS AVTFPD 1 1 RNYKVMAAENI PENPLKYL YPNIDKDHAFGKY YSRPKEAPEPMELD 

T GPKGTGYIKTELISVSEVHPSRLQTTDNIiLPMSPEEFDEVSRIVGSVEFDSMMNTV" 

2 Sequence 4003 BP; 1173 A; 812 C; 883 G; 1135 T; 0 Other; 

attaaacctc tcgccgagcc cctccgcaga ctctgcgccg gaaagtttca tttgctgtat 60 

gccatcctcg agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc 120 

ttggcaccta acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg 180 

ggcacaaggt ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct 240 

ggagcaggtt caccagcttt atgatgacag ttttcccatg gaaatcagac agtacctggc 3 00 

acagtggtta gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat 360 

ccgttttcat gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa 420 

taacttcttg ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca 480 



tatagagcat gaaatcaaga gcctggaaga tttacaagat gaatatgact tcaaatgcaa 720 

aaccttgcag aacagagaac acgagaccaa tggtgtggca aagagtgatc agaaacaaga 780 

acagctgtta ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca 840 

caaaataata gagttgctga atgtcactga acttacccag aatgccctga ttaatgatga 900 

actagtggag tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg 960 

cttggatcag ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca 1020 

gcagcttaaa aagttggagg aattggaaca gaaatacacc tacgaacatg accctatcac 1080 

aaaaaacaaa caagtgttat gggaccgcac cttcagtctt ttccagcagc tcattcagag 1140 

ctcgtttgtg gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt 1200 

gaagacaggg gtccagttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa 1260 

ttataatttg aaagtcaaag tcttatttga taaagatgtg aatgagagaa atacagtaaa 1320 

aggatttagg aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc 13 80 

caccaatggc agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaaaaatgc 144 0 

tggcaccaga acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt 1500 

tgaaacccaa ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt 1560 

tgtggtgatc tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa 1620 

catgctggtg gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg 1680 

ggctcagctt tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa 1740 

tgtggaccag ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg 1800 

tctcattccg tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg 1860 

gctttggatt gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga 1920 

tgggtgcatc atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca 1980 

gccggggacc ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac 2040 

atgggtggag cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac 2100 

gaagaaagaa ctttctgctg ttactttccc tgacatcatt cgcaattaca aagtcatggc 2160 

tgctgagaat attcctgaga atcccctgaa gtatctgtat ccaaatattg acaaagacca 2220 

tgcctttgga aagtattact ccaggccaaa ggaagcacca gagccaatgg aacttgatgg 2280 



ccctaaagga 
tagacttcag 
tcggatagtg 
ttttcatctt 
tccttcacat 
gttgatagca 
gcatcttact 
aacatccaga 
ttaagcaaca 
gataaatcag 
aattcttaca 
gaaatagttc 
catggatctg 
gacaaagaca 
aatgacacta 
atatccaaag 
ctcccaaggg 
gtagataaac 
attagggtgg 
ctgacaactt 
tcctatgtaa 
gcgaatggtt 
ttcgtgaagt 
ttaagagatg 
cttgtttttc 
ggtggagata 
attaaatata 
tctcagtttt 
ctaaaaaaca 



actggatata 
accacagaca 
ggctctgtag 
ctctggcgac 
cctgtgtttc 
agtgaatttt 
gaaggtaaaa 
tacacccaaa 
tctagcaaat 
tggttattta 
tgttttcttt 
aaagccaagt 
gtagggggaa 
aattctgttt 
gctaatatca 
ctgaatacat 
agttaggcta 
tcagaaattt 
tatttagtct 
gaataataca 
ctgcattgag 
ccattctctc 
atactgtatt 
ggtttgacaa 
cactactgct 
aagatttctt 
atatcgacac 
tatatagatg 
aagaagacaa 



tcaagactga 
acctgctccc 
aattcgacag 
agttttcctt 
tagggaaatg 
tctctaactc 
ttgaaaggca 
gtatcaggac 
gttatgcata 
gggaactgct 
gctttaagtg 
ttatatacaa 
aatgtgtatt 
cttgagaaga 
atagaaggat 
tctgctttca 
ttcacaacca 
aattcatgtt 
attagccaca 
ccagagataa 
aactgcatat 
tcctgtactt 
tttacctttt 
ggttcttccc 
accacaacta 
gagttttgtt 
agtgctttcc 
gcgagaacct 
cattaaaaac 



gttgatttct 
catgtctcct 
tatgatgaac 
ctcatctgtg 
aaagaaaggc 
agaaacatca 
ttctctgaag 
gagaatgagg 
aagtcagtgc 
tgacgtagga 
taactggcag 
ttatatcagt 
ttattacatc 
gaatattagc 
gtacatttcc 
tcttggtcac 
ctcattcaaa 
tcttaaatgg 
aaattgggaa 
tatgagaatc 
gtttcgctga 
tttccagaca 
tccttcctta 
ttttacatac 
tattatcatg 
ttaaaattaa 
gtggcactgc 
aagtttcagt 
aatattgttt 



gtgtctgaag 
gaggagtttg 
acagtataga 
attccctcct 
cagcaaattc 
gttactctga 
agtgggtttc 
gtcctttggg 
ccaactgtta 
acggtaaatt 
ttttccattg 
cctctttcaa 
tttcacattg 
tttactgttt 
aaattcacaa 
atacaattat 
agttgaaatt 
gctactttgt 
aggagtagaa 
agatcatttc 
tatatgtgtt 
cttttttgag 
tcactgacac 
tgctgtctat 
caaatgctgt 
agctaaagta 
atacaatctg 
tgattttaca 
eta 



ttcacccttc 
acgaggtgtc 
gcatgaattt 
gctactctgt 
gctgcaacct 
agggcatcat 
acaagtgaaa 
aaaggagaag 
taggttgttg 
tctgtgggag 
gtttacctgt 
aggtagecat 
gctatttaaa 
gttatggctt 
gttgtgtttg 
ttttacagtt 
aaccatagat 
cctttttgtt 
aaagcagtaa 
aaaactcatt 
tttcacattt 
tggatgatgt 
aaaaagtaga 
gtggctgtat 
attcttcttt 
tctgtattgc 
aggcctcctc 
attgaaatga 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4003 



>E 



Homo sapiens RNA helicase (RIG-I) mRNA, complete cds 



' T /translation= "MTTEQRRSLQAFQDYIRKTLDPTYILSYMAPWFREEEVQYIQAEK 

T ^KGPMEAATLFLKFLLELQEEGWFRGFLDW^HAGYSGLYEAIESWDPKKIEKLEEYR 

LLLKRIiQPEFKTRI I PTDI ISDLSECIiINQECEE ILQICSTKGMMAGAEKLVECLLRSD 
KENWPKTLKLALEKERNKFSELWIVEKGIKDVETEDLEDKMETSDIQIFYQEDPECQNL 
SENSCPPSEVSDTOTjYSPFKPRNYQLELALPAMKGKNTIICAPTGCGKTFVSLLICEHH 
LKKFPQGQKGKWFF ANQI PVYEQQKS VFS KYFERHG YRVTGI SGATAENVPVEQ I VEN 

^^^^^^^^^^^^^^^^^TIPSLSIFTI^IFDECHNTSKQHPYIJMIMFTIYIjDQKLGGSS 
GPLPQVIGIiTASVGVGDAKNTDEALDYICKLCy^IiDASVIATVKHNliEELEQVVYKPQK 
FFRKA^SRISDKFKYIIAQLMRDTESLAKRICKDLENIiSQIQNREFGTQKYEQWIVTVQ 

kacmvfqmpdkdeesrickalflytshlrkyndaliiseharmkdaldylkdffsnvra 
agfeeieqdltqrfeeklqelesvsrdpsnempkledlcfilqeeyhlnpetitilfvk 
tpj^vdalknwiegnpklsflkpgiltgrgktnqntgmtlpaqkcildafkasgdhnil 

IATSVADEGIDIAQCNLVILYEYVGNVIKMIQTRGRGRARGSKCFIJjTSNAGVIEKEQI 
NOT^KMNBTOSILRLQTWDEA 

CKAIACYTADVRVIEECHYTVI^AFKECFVSRPHPKPKQFSSFEKRAKIFCARQNCSH 
DWGIHVKYKTFEIPVIKIESFVVEDIATGVQTLYSKWKDFHFEKIPFDPAEMSK" 



'T 

T 

T 

T 

T 
T 

T 
T 
T 
T 
•T 
■T 
T 
'T 
X 

Q Sequence 3065 BP; 1028 A; 592 C; 669 G; 776 T; 0 Other; 

tagttattaa agttcctatg cagctccgcc tccgtccggc ctcatttcct' caaaaaatcc 60 
ctgctttccc cgctcgccac gccctcctgc tacccggctt taaagctagt gaggcacagc 120 
ctgcggggaa cgtagctagc tgcaagcaga ggccggcatg accaccgagc agcgacgcag 180 
cctgcaagcc ttccaggatt atatccggaa gaccctggac cctacctaca tcctgagcta 240 
catggccccc tggtttaggg aggaagaggt gcagtatatt caggctgaga aaaacaacaa 300 
gggcccaatg gaggctgcca cactttttct caagttcctg ttggagctcc aggaggaagg 36 0 
ctggttccgt ggctttttgg atgccctaga ccatgcaggt tattctggac tttatgaagc 420 
cattgaaagt tgggatttca aaaaaattga aaagttggag gagtatagat tacttttaaa 
acgtttacaa ccagaattta aaaccagaat tatcccaacc gatatcattt ctgatctgtc 
tgaatgttta attaatcagg aatgtgaaga aattctacag atttgctcta ctaaggggat 
gatggcaggt gcagagaaat tggtggaatg ccttctcaga tcagacaagg aaaactggcc 660 
caaaactttg aaacttgctt tggagaaaga aaggaacaag ttcagtgaac tgtggattgt 72 0 

agagaaaggt ataaaagatg ttgaaacaga agatcttgag gataagatgg aaacttctga 
catacagatt ttctaccaag aagatccaga atgccagaat cttagtgaga attcatgtcc 
accttcagaa gtgtctgata caaacttgta cagcccattt aaaccaagaa attaccaatt 
agagcttgct ttgcctgcta tgaaaggaaa aaacacaata atatgtgctc ctacaggttg 
tggaaaaacc tttgtttcac tgcttatatg tgaacatcat cttaaaaaat tcccacaagg 
acaaaagggg aaagttgtct tttttgcgaa tcagatccca gtgtatgaac agcagaaatc 
tgtattctca aaatactttg aaagacatgg gtatagagtt acaggcattt ctggagcaac 1140 
agctgagaat gtcccagtgg aacagattgt tgagaacaat gacatcatca ttttaactcc 1200 
acagattctt gtgaacaacc ttaaaaaggg aacgattcca tcactatcca tctttacttt 1260 
gatgatattt gatgaatgcc acaacactag taaacaacac ccgtacaata tgatcatgtt 1320 
taattatcta gatcagaaac ttggaggatc ttcaggccca ctgccccagg tcattgggct 1380 
gactgcctcg gttggtgttg gggatgccaa aaacacagat gaagccttgg attatatctg 1440 
caagctgtgt gcttctcttg atgcgtcagt gatagcaaca gtcaaacaca atctggagga 1500 
actggagcaa gttgtttata agccccagaa gtttttcagg aaagtggaat cacggattag 1560 
cgacaaattt aaatacatca tagctcagct gatgagggac acagagagtc tggcaaagag 1620 
aatctgcaaa gacctcgaaa acttatctca aattcaaaat agggaatttg gaacacagaa 1680 
atatgaacaa tggattgtta cagttcagaa agcatgcatg gtgttccaga tgccagacaa 174 0 

agatgaagag agcaggattt gtaaagccct gtttttatac acttcacatt tgcggaaata 1800 
taatgatgcc ctcattatca gtgagcatgc acgaatgaaa gatgctctgg attacttgaa i860 
agacttcttc agcaatgtcc gagcagcagg attcgaagag attgagcaag atcttactca 1920 
gagatttgaa gaaaagctgc aggaactaga aagtgtttcc agggatccca gcaatgagaa 1980 
tcctaaactt gaagacctct gcttcatctt acaagaagag taccacttaa acccagagac 2040 
aataacaatt ctctttgtga aaaccagagc acttgtggac gctttaaaaa attggattga 2100 
aggaaatcct aaactcagtt ttctaaaacc tggcatattg actggacgtg gcaaaacaaa 2160 
tcagaacaca ggaatgaccc tcccggcaca gaagtgtata ttggatgcat tcaaagccag 2220 



480 
540 
600 



780 
840 
900 
960 
1020 
1080 



taaaaatcac aatattctga ttgccacctc agttgctgat gaaggcattg acattgcaca 
oScaatctt gtcatcct?t atlagtatgt gggcaatgtc atcaaaatga tccaaaccag 
aglcagagga agagcaagag gtagcaagtg cttccttctg actagtaatg ctgg^gtaat 
SLaaagaa caaataaaca tgtacaaaga aaaaatgatg aatgactcta ttttacgcct 
tcaSca?gg gacgaagcag tatttaggga aaagattctg catatacaga ctcatgaaaa 
attcatcaS latagtcaal aaaaaccaaa acctgtccct gataaggaaa ataaaaaact 
SctctgcSa aagtlcaaag ccttggcatg ttacacagct gacgtaagag tgatagagga 
atgccatSc acigtgcttg gagatgcttt taaggaatgc tttgtgagta gaccacatcc 
caaaccaaag cagttttcaa gttttgaaaa aagagcaaag atattctgtg cccgacagaa 
ctacagcca? gactggggaa tccatgtgaa gtacaagaca tttgagattc cagttataaa 
aa?t2aagt ?ttgtgg?gg aggatattgc aactggagtt cagacactgt actcgaagtg 
aaagScSt cat?tSaS aStaccatt tgatccagca gaaatgtcca aatgatatca 
SSSSSJ cttcagctac agggaatgag taactttgag tggagaagaa -aaacatag 
igggtataat catggatcgc ttgtacccct gtgaaaatat attttttaaa aataaaaaaa 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3065 



aaaaa 



Homo sapiens melanoma differentiation associated protein-5 (MDA5) mRNA, 
DE complete cds . 



?T 
?T 
?T 
?T 



?T 



?T / 1 ran s 1 a t i on = n MSNG YS TDENFRYL ISC FRARVKMY I QVE P VLD YLTFL PAEVKE Q 

?T IQRTVATS GNMQAVELLLSTLEKGVWHLGWTREFVEAIiRRTGS PLAARYMNPELTDLP S 

PSFENAHDEYTjQLLNLLQPTLVDKLLVRDVIjDKC^ 
VRELLKRIVQKENWFSAFLNVLRQTGNNELVQELTGSDCSESN^ 

QLLSTTVQPNLEKEVWGMENNSSESSFADSSVVSESDTSIiAEGSVSCIiDESLGHNSNMG 
SDSGTMGSDSDEENVAARASPEPELQLRPYQMEVAQPALEGKNI I ICLPTGSGKTRVAV 
YIAIODHIiDKKXKASEPGKVIVLV^ 
?T FPEVVKSCDIIISTAQILENSIiLNLENGEDAGVQLSDFS 

?T RHYLMQKLKNNRLKKENKP VI PLPQ ILGLTAS PGVGGATKQAKAEEHI LKLCANLDAFT 

?T I KTVKENIjDQLKNQIQEPCKKFAIADATREDPFKEKLIjE imtri qtycqms pmsdfgtq 

?T PYEQWAIQMEKXAAKKGNRKERVCAEHLRKYNEALQINDTI^ 
?T KKFAVIEDDSDEGGDDEYCDGDEDEDDLKKPLKLDETDRFLMTLFFENNKMLKR^ 

EYENEKLTKLRNTIMEQYTRTEESARGIIFTKTRQSAYALSQWITENEKFAEVGVKAHH 
L IGAGHS S EFKPMTQNEQKEVI S KFRTGKINIiIiIATTVAEEGLD I KECNIVTRYGLVTN 
E I AMVQARGRARADES TYVLVAHSGSGVI EHETVNDFREKMMYKAIHCVQNMKPEEYAH 
■ T KILE LQMQSIMEKKMKTK30JIAKHYK1^SL^ 

?T OTPEFKELYIVRENKALQKKCADYQINGEIICKCGQAWGl^^VHKGLDLPCLKIRNFVV 

- T VFKNNSTKKQYKKWVELPITFPNLDYSECCIiFSDED n 

CX 

>Q Sequence 3380 BP; 1153 A; 644 C; 753 G; 830 T; 0 other; 

gcgcgccggc ctgagagccc tgtggacaac ctcgtcattg tcaggcacag agcggtagac 60 
cctgcttctc taagtgggca gcggacagcg gcacgcacat ttcacctgtc ccgcagacaa 120 
cagcaccatc tgcttgggag aaccctctcc cttctctgag aaagaaagat gtcgaatggg 180 
tattccacag acgagaattt ccgctatctc atctcgtgct tcagggccag ggtgaaaatg 240 
tacatccagg tggagcctgt gctggactac ctgacctttc tgcctgcaga ggtgaaggag 300 
cagattcaga ggacagtcgc cacctccggg aacatgcagg cagttgaact gctgctgagc 360 
accttggaga agggagtctg gcaccttggt tggactcggg aattcgtgga ggccctccgg 420 
agaaccggca gccctctggc cgcccgctac atgaaccctg agctcacgga cttgccctct 480 
ccatcgtttg agaacgctca tgatgaatat ctccaactgc tgaacctcct tcagcccact 540 
ctggtggaca agcttctagt tagagacgtc ttggataagt gcatggagga ggaactgttg 600 
acaattgaag acagaaaccg gattgctgct gcagaaaaca atggaaatga atcaggtgta 660 
agagagctac taaaaaggat tgtgcagaaa gaaaactggt tctctgcatt tctgaatgtt 720 
cttcgtcaaa caggaaacaa tgaacttgtc caagagttaa caggctctga ttgctcagaa 780 
agcaatgcag agattgagaa tttatcacaa gttgatggtc ctcaagtgga agagcaactt 840 
ctttcaacca cagttcagcc aaatctggag aaggaggtct ggggcatgga gaataactca 900 
tcagaatcat cttttgcaga ttcttctgta gtttcagaat cagacacaag tttggcagaa 960 
ggaagtgtca gctgcttaga tgaaagtctt ggacataaca gcaacatggg cagtgattca 1020 
ggcaccatgg gaagtgattc agatgaagag aatgtggcag caagagcatc cccggagcca 1080 
gaactccagc tcaggcctta ccaaatggaa gttgcccagc cagccttgga agggaagaat 1140 
atcatcatct gcctccctac agggagtgga aaaaccagag tggctgttta cattgccaag 1200 
gatcacttag acaagaagaa aaaagcatct gagcctggaa aagttatagt tcttgtcaat 1260 
aaggtactgc tagttgaaca gctcttccgc aaggagttcc aaccattttt gaagaaatgg 1320 
tatcgtgtta ttggattaag tggtgatacc caactgaaaa tatcatttcc agaagttgtc 1380 
aagtcctgtg atattattat cagtacagct caaatccttg aaaactccct cttaaacttg 1440 
gaaaatggag aagatgctgg tgttcaattg tcagactttt ccctcattat cattgatgaa 1500 
tgtcatcaca ccaacaaaga agcagtgtat aataacatca tgaggcatta tttgatgcag 1560 
aagttgaaaa acaatagact caagaaagaa aacaaaccag tgattcccct tcctcagata 1620 
ctgggactaa cagcttcacc tggtgttgga ggggccacga agcaagccaa agctgaagaa 1680 
cacattttaa aactatgtgc caatcttgat gcatttacta ttaaaactgt taaagaaaac 1740 
cttgatcaac tgaaaaacca aatacaggag ccatgcaaga agtttgccat tgcagatgca 1800 
accagagaag atccatttaa agagaaactt ctagaaataa tgacaaggat tcaaacttat 1860 
tgtcaaatga gtccaatgtc agattttgga actcaaccct atgaacaatg ggccattcaa 1920 
atggaaaaaa aagctgcaaa aaaaggaaat cgcaaagaac gtgtttgtgc agaacatttg 1980 
aggaagtaca atgaggccct acaaattaat gacacaattc gaatgataga tgcgtatact 2040 



catcttgaaa ctttctataa tgaagagaaa gataagaagt ttgcagtcat agaagatgat 
aatqatgagg gtggtgatga tgagtattgt gatggtgatg aagatgagga tgatttaaag 
aaacct?tga aactggatia aacagataga tttctcatga ctttattttt tgaaaacaat 
aaaatqttga aaaggctggc tgaaaaccca gaatatgaaa atgaaaagct gaccaaatta 
agaaaScX taa?ggag1a alatactagg actgaggaat cagcacgagg ^ataatcttt 
aSaaaacac gaca^agtgc atatgcgctt tcccagtgga "actgaaaa tgaaaaattt 
actgaagtag gagtcaaagc ccaccatctg attggagctg gacacagcag tgagttcaaa 
IccSgacac agaatgaaca aaaagaagtc attagtaaat ttcgcactgg aaaaatcaat 
ctQcttatcg ctaccacagt ggcagaagaa ggtctggata ttaaagaatg taacattgtt 
tESSESi gtctcgtclc Xatgaaata gccatggtcc aggcccgtgg tcgagccaga 
gctgEgaga gcacclacgt cctggttgct cacagtggtt caggagttat cgaacatgag 
Scagttaalg atttccgaga gaagatgatg tataaagcta tacattgtgt tcaaaatatg 
aaaccagagg agtatgctca taagattttg gaattacaga tgcaaagtat jatggaaaag 
aaaatcjaaaa ccaagagaaa tattgccaag cattacaaga ataacccatc actaataact 
^SKgca aaaactgcag tgtgctagcc tgttctgggg -gatatcca tgtaattgag 
aaaatgcatc acgtcaatat gaccccagaa ttcaaggaac tttacattgt aagagaaaac 
aaaqcactgc aaaagaagtg tgccgactat caaataaatg gtgaaatcat ctgcaaatgt 
ggcSggSt ggggaacla? gatggtgcac aaaggcttag atttgccttg tctcaaaata 
ISaattttg Sgtggtttt caaaaataat tcaacaaaga aacaatacaa aaagtgggta 
SKtaccta tcacSttcc caatcttgac tattcagaat gctgtttatt ^agtgatgag 
gattagcact tgattgaaga ttcttttaaa atactatcag ttaaacattt aatatgatta 
tgattaatgt a?tca?tatg ctacagaact gacataagaa tcaataaaat gattgtttta 



ctctgaaaaa aaaaaaaaaa 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3380 



JE Homo sapiens signal transducer and activator of transcription 1, 9ikDa 

)B transcript variant beta, mRNA (cDNA clone MGC:3493 IMAGE : 3627218) , complete 

JE cds . 



?T / translation= "MSQWYELQQLDSKFLEQVHQI.YDDSFPMEIRQyLAQWLEKQDWEH 

r AANDVSFATIRFHDIiIiSQLDDQYSRFSLENNPLIjQHNIRKSKRNLQDNFQEDPIQMSMI 

IYSCLKEERKIIiENAQRFNQAQSGWIQSTVMLDKQKELDSKVRNVKDKVMCIEHEIKSIi 
EDLQDEYDFKCKTLQmEHETNGVAKSDQKQEQLLIiKKMYLMLDNKRKEWHKIIELlJJ 
VTELTQNALINDELVEWKRRQQSACIGGPPNACLDQLQNWFTIVAESLQQVRQQLKKLE 
ELEQKyTYEHDPITKNKQVl^WDRTFSLFQQLIQSSFVVERQPCMPTHPQRPLVLKTGVQ 
FWKI^LLVKLQELNYNLKvKVLFDKDVNERNTVKGFRKFNIl^ra 
IiAAEPRHLQLKEQKNAGTRTNEGPLIVTEELHSLSFETQLCQPGLVIDLETTSLPVVVI 
SNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPCARWAQLSEVLSWQFSSvTKRGLNVD 
QLNMLGEKLIjGPNASPDGLI PWTRFCKENINDKtJFP F WLW IES I LEIi IKKHLLPLWNDG 

CIMGFISKERERALLKDQQPGTFLLRFSESSREGAITFTWVERSQNGGEPDFHAVEPYT 
KKELSAVTFPDIIRNYKVMAAENIPENPLKYLYPNIDKDHAFGKYYSRPKEAPEPMELD 



i»r 
rr 

? T 

>T 
•T 
•'T 

T GPKGTGYIKTELI SVSEV" 



X 

•Q Sequence 2629 BP; 746 A; 594 C; 653 G; 636 T; 0 other; 

tcgctttcct gcgcagagtc tgcggagggg ctcggctgca ccggggggat cgcgcctggc 60 

agaccccaga ccgagcagag gcgacccagc gcgctcggga gaggctgcac cgccgcgccc 120 

ccgcctagcc cttccggatc ctgcgcgcag aaaagtttca tttgctgtat gccatcctcg 180 

agagctgtct aggttaacgt tcgcactctg tgtatataac ctcgacagtc ttggcaccta 240 

acgtgctgtg cgtagctgct cctttggttg aatccccagg cccttgttgg ggcacaaggt 3 00 

ggcaggatgt ctcagtggta cgaacttcag cagcttgact caaaattcct ggagcaggtt 360 

caccagcttt atgatgacag ttttcccatg gaaatcagac agtacctggc acagtggtta 420 

gaaaagcaag actgggagca cgctgccaat gatgtttcat ttgccaccat ccgttttcat 480 

gacctcctgt cacagctgga tgatcaatat agtcgctttt ctttggagaa taacttcttg 540 

ctacagcata acataaggaa aagcaagcgt aatcttcagg ataattttca ggaagaccca 600 

atccagatgt ctatgatcat ttacagctgt ctgaaggaag aaaggaaaat tctggaaaac 660 

gcccagagat ttaatcaggc tcagtcgggg aatattcaga gcacagtgat gttagacaaa 720 

cagaaagagc ttgacagtaa agtcagaaat gtgaaggaca aggttatgtg tatagagcat 780 

gaaatcaaga gcctggaaga tttacaagat gaatatgact tcaaatgcaa aaccttgcag 840 

aacagagaac acgagaccaa tggtgtggca aagagtgatc agaaacaaga acagctgtta 900 

ctcaagaaga tgtatttaat gcttgacaat aagagaaagg aagtagttca caaaataata 960 

gagttgctga atgtcactga acttacccag aatgccctga ttaatgatga actagtggag 1020 

tggaagcgga gacagcagag cgcctgtatt ggggggccgc ccaatgcttg cttggatcag 1080 

ctgcagaact ggttcactat agttgcggag agtctgcagc aagttcggca gcagcttaaa 1140 

aagttggagg aattggaaca gaaatacacc tacgaacatg accctatcac aaaaaacaaa 1200 

caagtgttat gggaccgcac cttcagtctt ttccagcagc tcattcagag ctcgtttgtg 1260 

gtggaaagac agccctgcat gccaacgcac cctcagaggc cgctggtctt gaagacaggg 1320 

gtccagttca ctgtgaagtt gagactgttg gtgaaattgc aagagctgaa ttataatttg 1380 

aaagtcaaag tcttatttga taaagatgtg aatgagagaa atacagtaaa aggatttagg 1440 

aagttcaaca ttttgggcac gcacacaaaa gtgatgaaca tggaggagtc caccaatggc 1500 

agtctggcgg ctgaatttcg gcacctgcaa ttgaaagaac agaaaaatgc tggcaccaga 1560 

acgaatgagg gtcctctcat cgttactgaa gagcttcact cccttagttt tgaaacccaa 1620 

ttgtgccagc ctggtttggt aattgacctc gagacgacct ctctgcccgt tgtggtgatc 1680 

tccaacgtca gccagctccc gagcggttgg gcctccatcc tttggtacaa catgctggtg 1740 

gcggaaccca ggaatctgtc cttcttcctg actccaccat gtgcacgatg ggctcagctt 1800 

tcagaagtgc tgagttggca gttttcttct gtcaccaaaa gaggtctcaa tgtggaccag i860 

ctgaacatgt tgggagagaa gcttcttggt cctaacgcca gccccgatgg tctcattccg 1920 

tggacgaggt tttgtaagga aaatataaat gataaaaatt ttcccttctg gctttggatt 1980 

gaaagcatcc tagaactcat taaaaaacac ctgctccctc tctggaatga tgggtgcatc 2040 

atgggcttca tcagcaagga gcgagagcgt gccctgttga aggaccagca gccggggacc 2100 

ttcctgctgc ggttcagtga gagctcccgg gaaggggcca tcacattcac atgggtggag 2160 

cggtcccaga acggaggcga acctgacttc catgcggttg aaccctacac gaagaaagaa 2220 



ctttctgctg 
attcctgaga 
aagtattact 
actggatata 
tgacatgttt 
gtattttact 
gtgttactgc 



ttactttccc 
atcccctgaa 
ccaggccaaa 
tcaagactga 
acaaacctca 
tttccattgt 
ctatcagcat 



tgacatcatt 
gtatctgtat 
ggaagcacca 
gttgatttct 
agccagcctt 
aattgctatc 
tttactactt 



cgcaattaca 
ccaaatattg 
gagccaatgg 
gtgtctgaag 
gctcctggct 
gccatcacag 
taaaaaaaaa 



aagtcatggc 
acaaagacca 
aacttgatgg 
tgtaagtgaa 
ggggcctgtt 
ctgaacttgt 
aaaaaaaaa 



tgctgagaat 
tgcctttgga 
ccctaaagga 
cacagaagag 
gaagatgctt 
tgagatcccc 



2280 
2340 
2400 
2460 
2520 
2580 
2629 



)E Homo sapiens cDNA: FLJ21350 fis, clone COL02751. 

tttttttttt tttttttttt aagcaagccc ccaacaccat agaaaattct tgatttgctc 60 

ggaggataat tggatgaagg attattttct tctttgttta tgtgcaagaa atgaaaataa 120 

ggaattgctt tgatcagaca acttcttatc tttgtggtag aaacagaact gcccttcttg 180 

gagtggctct gcctctgaga tcactacagg ggagacagca tgccctgttc agctggctga 240 

atatttggca acaatctcct gaagcagctg gaattgacaa gaagtactgg agattagctc 3 00 

gggccaaacc cttacatctg gcctgactac tgctgcagtc tgcctcaact tacccctaaa 360 

gctggggaga tgccacccac ccacatcttt gctacacatg ccatcatgag ctagagttca 420 

ccctttctcc ttaaagccct atttactttt ctacttcaac tttaaaacaa aattaaaatg 480 

tgaggatatc cctgaatttt aaaaagcatg aagtaaaaat gcaaattagt atagtttgtt 540 

taatacatta catatagacc taaagaaagt tcatcagggt taatcatttg tcacatcatt 600 

ctatacccag ggctatcagc tatcaatttt cctttttttt tttttttttt tttaattcag 660 

gatccagctc tgtcacccag gctggagtgc agtatcaaag tatcatttct cttacttcaa 720 

attattacat tttattctgt acattgattc tgaactccta atataatatt tatgtcctgt 780 

atttgcaggc cattggtttt tttaaagtca taaatcaaaa tgatgccaga aaatcaaaga 840 

tgcccaagat gttgggcttc tcttttgcca gccacattgg tagcactctc ctgccctggc 900 

ctccaagctg gtggggatct gtgatgtttg tgaaaatggg cttatgccag gcgcagtggc 960 

tcacgcctat aatcccagca ctttgggagg ccgaggcggg cagatcggtt caggtcagga 1020 

gatcgagacc agcctggcca acatggtgaa accccatctc tactaaaaat acaaaaaatt 1080 

agccaggtgt ggtggcacat gcctgtaatc ccagctactc aggaggctga ggcaggagaa 1140 

tcacttgaaa cctgggaggc ggaggttgca gtgagcagag atcgtgccat tacactccag 1200 

cctgggctac agagtgagac tctgtgtcaa aaaaaaaaga aagaaaatgg gcttgtgtgg 1260 

tagcaggtaa gaaattgaat ctctgttgta cagcagctag ctgtactgca tgatcacttc 1320 

ccattcccca gctgacagtg gctgtctctg gaactcctac cacagtcttc aattggtagg 1380 

ccagccctgg tgccagtgat tttatctggg catggaaaat gccacttgct tctgtggaag 1440 

agacacttaa aagatctggc agtcggccgg gtgcggtggc tcacgcctat aatcccaaca 1500 

ctctgggagg tcaaggcagg cggatcacga ggtcaggaga tggagaccat cctggctaac 1560 

acggtgaaac cctgtctcta ctaaaaaaaa aaataaaaaa ttagctgggc gaggtggcgg 1620 

gcgcctgtag tcccagctac tctggaggct gaggcaggag aatggcgtga acccaggagg 1680 

cggagcttgc agtgatccga gatcacacca ctgcactgca gtctgggcaa cagagcgaga 1740 

ctccatctca aaaaaaaaaa aaaaa ^ ~ ^ in** 



Homo sapiens IFI16b (IFI16b) nvRNA, complete cds. 



/ trans 1 at ion= "MGKKYKNIVLLKGLEVINDYHFRMVKSLLSNDLKLNIjKMRE^ 
IQI ADLMEEKFRGDAGLGKLIK1 FED I PTIiEDIiAETLKKEKLKVKGPAIjSRKRKKEVHA 
TSPAPSTSST\nCTEGAEATPGAQKRKKSTKEKAGPKGSKVSEEQTQPPSPAGAGMSTAM 

GRSPSPKTSLSAPPNSSSTENPKTVAKCQVT^ 

MEKKIMFHATVATQTQFFHVKVliNTSLKEKFNGKKI III SDYLE YDSLLEVNEESTVSE 
AGPNQTFEVPNKI INRAKETLKID ILHKQASGNIVYGVFMIiHKKTVNQKTTI YEIQDDR 
GKI^WGTGQC^IPCEEGDKLQIjFCFRIiRKKNQMSKLISEWH 

KSMKLPQEQRQIiPYPSEASTTFPESHIiRTPQMPPTTPSSSFFTKKSEDTISKMiroFMRM 
QILKEGSHFPGPFMTSIGPAESHPHTPQMPPSTPSSSFLTTLKPRLKTEPEEVSIEDSA 
QSDLKEVMVLNATE S FVYEPKEQKKMFHATVATENEVFRVKVFNIDIiKEKFTPKKI IAI 
ANYVCRNGFLEVYPFTLVADVNADRNMEIPKGLIRSASVTPKINQLCSQTKGSFWGV 
EVHKVSPHHCFIKFIiLQPPIFKVLTCQIiEFGQIiTQHRKSTPSPFPQH" 



Sequence 4151 BP; 1436 A; 806 C; 798 G; 1111 T; 0 other; 
gggaatagca gaataggagc aagccagcac tagtcagcta actaagtgac tcaaccaagg 
ccttttttcc ttgttatctt tgcagatact tcattttctt agcgtttctg gagattacaa 
catcctgcgg ttccgtttct gggaacttta ctgatttatc tcccccctca cacaaataag 
cattgattcc tgcatttctg aagatctcaa gatctggact actgttgaaa aaatttccag 
tgaggctcac ttatgtctgt aaagatggga aaaaaataca agaacattgt tctactaaaa 
ggattagagg tcatcaatga ttatcatttt agaatggtta agtccttact gagcaacgat 
ttaaaactta atttaaaaat gagagaagag tatgacaaaa ttcagattgc tgacttgatg 
gaagaaaagt tccgaggtga tgctggtttg ggcaaactaa taaaaatttt cgaagatata 
ccaacgcttg aagacctggc tgaaactctt aaaaaagaaa agttaaaagt aaaaggacca 
gccctatcaa gaaagaggaa gaaggaagtg catgctactt cacctgcacc ctccacaagc 
agcactgtca aaactgaagg agcagaggca actcctggag ctcagaaaag aaaaaaatca 
accaaagaaa aggctggacc caaagggagt aaggtgtccg aggaacagac tcagcctccc 
tctcctgcag gagccggcat gtccacagcc atgggccgtt ccccatctcc caagacctca 
ttgtcagctc cacccaacag ttcttcaact gagaacccga aaacagtggc caaatgtcag 
gtaactccca gaagaaatgt tctccaaaaa cgcccagtga tagtgaaggt actgagtaca 
acaaagccat ttgaatatga gaccccagaa atggagaaaa aaataatgtt tcatgctaca 
gtggctacac agacacagtt cttccatgtg aaggttttaa acaccagctt gaaggagaaa 
ttcaatggaa agaaaatcat catcatatca gattatttgg aatatgatag tctcctagag 
gtcaatgaag aatctactgt atctgaagct ggtcctaacc aaacgtttga ggttccaaat 
aaaatcatca acagagcaaa ggaaactctg aagattgata ttcttcacaa acaagcttca 
ggaaatattg tatatggggt atttatgcta cataagaaaa cagtaaatca gaagaccaca 
atctacgaaa ttcaggatga tagaggaaaa atggatgtag tggggacagg acaatgtcac 
aatatcccct gtgaagaagg agataagctc cagcttttct gctttcgact tagaaaaaag 
aaccagatgt caaaactgat ttcagaaatg catagtttta tccagataaa gaaaaaaaca 
aacccgagaa acaatgaccc caagagcatg aagctacccc aggaacagcg tcagcttcca 
tatccttcag aggccagcac aaccttccct gagagccatc ttcggactcc tcagatgcca 1560 
ccaacaactc catccagcag tttcttcacc aagaaaagtg aagacacaat ctccaaaatg 1620 
aatgacttca tgaggatgca gatactgaag gaagggagtc attttccagg accgttcatg lfiRn 
accagcatag gcccagctga gagccatccc cacactcctc agatgcctcc atcaacacca 
agcagcagtt tcttaaccac gttgaaacca agactgaaga ctgaacctga agaagtttcc 
atagaagaca gtgcccagag tgacctcaaa gaagtgatgg tgctgaacgc aacagaatca 
tttgtatatg agcccaaaga gcagaagaaa atgtttcatg ccacagtggc aactgagaat 1920 
gaagtcttcc gagtgaaggt ttttaatatt gacctaaagg agaagttcac cccaaagaag 1980 
atcattgcca tagcaaatta tgtttgccgc aatgggttcc tggaggtata tcctttcaca 2040 
cttgtggctg atgtgaatgc tgaccgaaac atggagatcc caaaaggatt gattagaagt 2100 
gccagcgtaa ctcctaaaat caatcagctt tgctcacaaa ctaaaggaag ttttgtgaat 2160 
ggggtgtttg aggtacataa ggtaagccca caccattgtt ttataaaatt tctcctgcaa 2220 
cctccaattt ttaaagtctt aacttgtcaa ctggagtttg gtcaacttac tcaacacaga 2280 
aaatcaaccc cttcaccctt cccccagcac tagagataat tgaatagagt tcatttcagg 2340 
atatggggta cgttatattg taacattcct cttcttaagg tatcatcatg caagttattt moo 
agacagtcac taggaaactt ggcattttat tagttttgat gatctattca gagccaccct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



1680 
1740 
1800 
1860 



2400 
2460 



tgtccaggac agtgcagagt ttatatcaac acacatatcc ttaggatttt gtttctttga 2520 

gttcttctcc atctgtatca atgacaactt aatttaattg tgaataaaag agttgctctc 2580 

ccaagcctga atcctgattg tgacaaccag agtaagaaat aaaatagact actctgcttt 2640 

agaatgcagc tatgtctaac agttagctag aattctgatc atttggactc caaagtttct 2700 

tgcctcttct cattcattaa ttcatcagga gactgtagag caactaactt ctgcattaaa 2760 

taataagaga aatacgaagc aaaaagacta aaaaagtcac gtagcttaac tgctcaattt 2820 

ataaatgggg caataaaatg caaaaaaaaa gaaaaaaagc ttggtgaatt cttaggctta 2880 

cagtgtgcct ttcagtctct acacatcatg taaatattat gcttagctga tttaacttct 2940 

tgtttgaagt actgtttcat actccattat acatgtcttc tagggtggct tacttttaat 3 000 

tgtgctgttt tctctacact cagtttaaat gactgtacat atatatgtgg ttggagagtt 3060 

aatgaataat gagctacaaa ccagaacaat gtgactagat agataggatg atctagaatt 3120 

gagaactggc agattgggaa aagagtggct atatggagaa agaaagaaag tagttccata 3180 

ttgaaataac agtctactta atgaggaccg ttgcaacatt ctttctcaaa cttacaaagt 3240 

gccataaaaa gcctctattc tctgctcttg ggcaggtgtg aaagaaacct accaaattaa 33 00 

tcagattttt ctgtatccag gctccttaaa aaatcccagc tgtgctgatg tggaaacagg 3360 

aagaattagg aaagtaatca attttttttc ctagaaaaaa tccagcagac aaagaacttc 3420 

aacaaaagag gctcaaggga ggagttgaaa ggcaggattc aaagaccaag tatcttaagc 3480 

tatttggtac ctgttattca ggacctacag ctctgtttac tctatcaaag accaaaagtt 3540 

tccagaaaca ccctgtattt ctcatagatt tgaaaattat tgatccagtt tcagaagata 3 600 

agtgttaatt ttcttttgca gaaaaatgta aggggtgaat tcacttatta tgaaatacaa 3660 

gataatacag ggaagatgga agtggtggtg catggacgac tgaccacaat caactgtgag 3720 

gaaggagata aactgaaact cacctgcttt gaattggcac cgaaaagtgg gaataccggg 3780 

gagttgagat ctgtaattca tagtcacatc aaggtcatca agaccaggaa aaacaagaaa 3 840 

gacatactca atcctgattc caagtatgga aacttcacca gactttttct tctaaaatct 3900 

ggatgtcatt gacgataatg tttatggaga taaggtctaa gtgcctaaaa aaatgtacat 3960 

atacctggtt gaaatacaac actatacata cacaccacca tatatactag ctgttaatcc 4 020 

tatggaatgg ggtattggga gtgctttttt aatttttcat agtttttttt taataaaatg 4080 

gcatattttg catctacaac ttctataatt tgaaaaaata aataaacatt atcttttttg 4140 

tgaaaaaaaa a at en 




Homo sapiens mRNA for STAT induced STAT inhibitor-2, complete cds. 

/ trans lat ion= » MTLRCLBPSGNGGEGTRSQWGTAGSAEEPSPQAAMAKALREI^GQ 
TGWYWGSMTVNEAKEKLKEAPEGTFLII^SSHSD^ 

RLD S 1 1 CVKS KLKQFD SWHL ID YYVQMCKDKRTGPEAPRNGTVHL YIjTKPLiYT S AP S L 
QHLCmiTINKCTGAIWGIiPLPTRIiKDYLEEYKFQV" 

Sequence 704 BP; 198 A; 172 C; 174 G; 160 T; 0 other; 
gggcggccac ctgtctttgc cgcggtgacc cttctctcat gaccctgcgg tgccttgagc 
cctccgggaa tggcggggaa gggacgcgga gccagtgggg gaccgcgggg tcggcggagg 
agccatcccc gcaggcggcg cgtctggcga aggccctgcg ggagctcggt cagacaggat 
ggtactgggg aagtatgact gttaatgaag ccaaagagaa attaaaagag gcaccagaag 
gaactttctt gattagagat agctcgcatt cagactacct actaacaata tctgttaaaa 
catcagctgg accaactaat cttcgaatcg aataccaaga cggaaaattc agattggact 
ctatcatatg tgtcaaatcc aagcttaaac aatttgacag tgtggttcat ctgatcgact 
actatgttca gatgtgcaag gataagcgga caggtccaga agccccccgg aacggcactg 
ttcaccttta tctgaccaaa ccgctctaca cgtcagcacc atctctgcag catctctgta 
ggctcaccat taacaaatgt accggtgcca tctggggact gcctttacca acaagactaa 
aagattactt ggaagaatat aaattccagg tataaatgtt tctctttttt taaacatgtc 
tcacatagag tatctccgaa tgcagctatg taaaagagaa ccaa 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
704 



)E Homo sapiens transcription factor ISGF-3 mRNA, complete cds . 

IX 

CW transcription factor. 



'T 
'T 
'T 
'T 
T 
T 
T 
T 
T 
T 
*T 
•T 
T 
X 
Q 



/ translation= "MSQWYELQQIJDSKFIiEQVHQIiYDDSFPMEIRQYIAQWIiEKQDWEH 

AAimVSFATIRFHDLLSQIiDDQYSRFSLENNFLLQHNIRKSKRNLQDNPQEDPI 

IYSCLKEERKIIiENAQRFNQAQSGNIQSTVMLDKQKELDSK^ 

EDLQDEYDFKCKTLQNREHETNGVAKSDQKQEQL^ I ELLN 

VTELTQN AL IN DKLVEWKRRQQS AC IGGPPNACLDQLQNWFTI VAESIiQQVRQQLKKIiE 
ELEQKYTYEHDP I TKNKQVZiWDRTFSLFQQL IQS S FVVERQPCMPTHPQRPLVLKTGVQ 
FTVKLRLLVKLQELNYNLKVK^ 

LAAEFRHLQLKEQKNAGTRTNEGPIj I VTEELHS LS FETQLCQPGLVIDLETTS IiPVWI 

SNVSQLPSGWASILWYNMLVAEPRNLSFFLTPPC^ 

QLNMIjGEKIiI<GPNASPDGIiIPWTRFCKENIND^ 

CIMGFISKERERALLKDQQPGTFLIjRFSES SREGAI TFTWVERSQNGGEPDFHAVEPYT 
K3CELSAVTFPDIIRNYKVMAAENIPENPLKYL^^ 

GPKGTGYIKTELISVSEVHPSRLQTTDNIiLPMSPEEFDEVSRIVGSVEFDSMMNTV" 



Sequence 4003 BP; 1173 A; 812 C; 
attaaacctc tcgccgagcc cctccgcaga 
gccatcctcg agagctgtct aggttaacgt 
ttggcaccta acgtgctgtg cgtagctgct 
ggcacaaggt ggcaggatgt ctcagtggta 
ggagcaggtt caccagcttt atgatgacag 
acagtggtta gaaaagcaag actgggagca 
ccgttttcat gacctcctgt cacagctgga 
taacttcttg ctacagcata acataaggaa 
ggaagaccca atccagatgt ctatgatcat 
tctggaaaac gcccagagat ttaatcaggc 
gttagacaaa cagaaagagc ttgacagtaa 
tatagagcat gaaatcaaga gcctggaaga 
aaccttgcag aacagagaac acgagaccaa 
acagctgtta ctcaagaaga tgtatttaat 
caaaataata gagttgctga atgtcactga 
actagtggag tggaagcgga gacagcagag 
cttggatcag ctgcagaact ggttcactat 
gcagcttaaa aagttggagg aattggaaca 
aaaaaacaaa caagtgttat gggaccgcac 
ctcgtttgtg gtggaaagac agccctgcat 
gaagacaggg gtccagttca ctgtgaagtt 
ttataatttg aaagtcaaag tcttatttga 
aggatttagg aagttcaaca ttttgggcac 
caccaatggc agtctggcgg ctgaatttcg 
tggcaccaga acgaatgagg gtcctctcat 
tgaaacccaa ttgtgccagc ctggtttggt 
tgtggtgatc tccaacgtca gccagctccc 
catgctggtg gcggaaccca ggaatctgtc 
ggctcagctt tcagaagtgc tgagttggca 
tgtggaccag ctgaacatgt tgggagagaa 
tctcattccg tggacgaggt tttgtaagga 
gctttggatt gaaagcatcc tagaactcat 
tgggtgcatc atgggcttca tcagcaagga 
gccggggacc ttcctgctgc ggttcagtga 
atgggtggag cggtcccaga acggaggcga 
gaagaaagaa ctttctgctg ttactttccc 
tgctgagaat attcctgaga atcccctgaa 
tgcctttgga aagtattact ccaggccaaa 



883 G; 1135 T; 0 other; 

ctctgcgccg gaaagtttca tttgctgtat 60 

tcgcactctg tgtatataac ctcgacagtc 120 

cctttggttg aatccccagg cccttgttgg 180 

cgaacttcag cagcttgact caaaattcct 240 

ttttcccatg gaaatcagac agtacctggc 3 00 

cgctgccaat gatgtttcat ttgccaccat 3 60 

tgatcaatat agtcgctttt ctttggagaa 4 20 

aagcaagcgt aatcttcagg ataattttca 480 

ttacagctgt ctgaaggaag aaaggaaaat 54 0 

tcagtcgggg aatattcaga gcacagtgat 600 

agtcagaaat gtgaaggaca aggttatgtg 660 

tttacaagat gaatatgact tcaaatgcaa 720 

tggtgtggca aagagtgatc agaaacaaga 780 

gcttgacaat aagagaaagg aagtagttca 840 

acttacccag aatgccctga ttaatgatga 900 

cgcctgtatt ggggggccgc ccaatgcttg 960 

agttgcggag agtctgcagc aagttcggca 1020 

gaaatacacc tacgaacatg accctatcac 1080 

cttcagtctt ttccagcagc tcattcagag 1140 

gccaacgcac cctcagaggc cgctggtctt 1200 

gagactgttg gtgaaattgc aagagctgaa 1260 

taaagatgtg aatgagagaa atacagtaaa 1320 

gcacacaaaa gtgatgaaca tggaggagtc 13 80 

gcacctgcaa ttgaaagaac agaaaaatgc 1440 

cgttactgaa gagcttcact cccttagttt 1500 

aattgacctc gagacgacct ctctgcccgt 1560 

gagcggttgg gcctccatcc tttggtacaa 1620 

cttcttcctg actccaccat gtgcacgatg 1680 

gttttcttct gtcaccaaaa gaggtctcaa 174 0 

gcttcttggt cctaacgcca gccccgatgg 1800 

aaatataaat gataaaaatt ttcccttctg . i860 

taaaaaacac ctgctccctc tctggaatga 1920 

gcgagagcgt gccctgttga aggaccagca 1980 

gagctcccgg gaaggggcca tcacattcac 2040 

acctgacttc catgcggttg aaccctacac 2100 

tgacatcatt cgcaattaca aagtcatggc 2160 

gtatctgtat ccaaatattg acaaagacca 2220 

ggaagcacca gagccaatgg aacttgatgg 2280 



ccctaaagga actggatata tcaagactga gttgatttct gtgtctgaag ttcacccttc 2340 

tagacttcag accacagaca acctgctccc catgtctcct gaggagtttg acgaggtgtc 2400 

tcggatagtg ggctctgtag aattcgacag tatgatgaac acagtataga gcatgaattt 2460 

ttttcatctt ctctggcgac agttttcctt ctcatctgtg attccctcct gctactctgt 2520 

tccttcacat cctgtgtttc tagggaaatg aaagaaaggc cagcaaattc gctgcaacct 2580 

gttgatagca agtgaatttt tctctaactc agaaacatca gttactctga agggcatcat 2640 

gcatcttact gaaggtaaaa ttgaaaggca ttctctgaag agtgggtttc acaagtgaaa 2700 

aacatccaga tacacccaaa gtatcaggac gagaatgagg gtcctttggg aaaggagaag 2760 

ttaagcaaca tctagcaaat gttatgcata aagtcagtgc ccaactgtta taggttgttg 2 820 

gataaatcag tggttattta gggaactgct tgacgtagga acggtaaatt tctgtgggag 2 880 

aattcttaca tgttttcttt gctttaagtg taactggcag ttttccattg gtttacctgt 2 940 

gaaatagttc aaagccaagt ttatatacaa ttatatcagt cctctttcaa aggtagccat 3000 

catggatctg gtagggggaa aatgtgtatt ttattacatc tttcacattg gctatttaaa 3 060 

gacaaagaca aattctgttt cttgagaaga gaatattagc tttactgttt gttatggctt 312 0 

aatgacacta gctaatatca atagaaggat gtacatttcc aaattcacaa gttgtgtttg 318 0 

atatccaaag ctgaatacat tctgctttca tcttggtcac atacaattat ttttacagtt 324 0 

ctcccaaggg agttaggcta ttcacaacca ctcattcaaa agttgaaatt aaccatagat 3300 

gtagataaac tcagaaattt aattcatgtt tcttaaatgg gctactt H tgt cctttttgtt 3360 

attagggtgg tatttagtct attagccaca aaattgggaa aggagtagaa aaagcagtaa 342 0 

ctgacaactt gaataataca ccagagataa tatgagaatc agatcatttc aaaactcatt 3480 

tcctatgtaa ctgcattgag aactgcatat gtttcgctga tatatgtgtt tttcacattt 3540 

gcgaatggtt ccattctctc tcctgtactt tttccagaca cttttttgag tggatgatgt 3600 

ttcgtgaagt atactgtatt tttacctttt tccttcctta tcactgacac aaaaagtaga 3660 

ttaagagatg ggtttgacaa ggttcttccc ttttacatac tgctgtctat gtggctgtat 372 0 

cttgtttttc cactactgct accacaacta tattatcatg caaatgctgt attcttcttt 3780 

ggtggagata aagatttctt gagttttgtt ttaaaattaa agctaaagta tctgtattgc 3 84 0 

attaaatata atatcgacac agtgctttcc gtggcactgc atacaatctg aggcctcctc 3 900 

tctcagtttt tatatagatg gcgagaacct aagtttcagt tgattttaca attgaaatga 3 960 

ctaaaaaaca aagaagacaa cattaaaaac aatattgttt eta 4003 



X 

E Homo sapiens pancreas sodium bicarbonate cotransporter mRNA, complete cds. 
T /translat ion= n MEDEAVIjDRGASFLKHVCTE3BEVEGHHTI YIGVH\7PKS YRRRRRH 

p „ : ... . 

r 
r 
r 
r 
r 
r 
r 
r 
r 



KRKTGHKEKKEKERI SENYSDKSD IENADE SSSS ILKPL I S P AAER IRF I LGEEDD S PA 
PPQIiFTELDELLAVDGQEMEWKETARWIKFEEKVEQGGERWSKPHVATLSLHSLF 
CMEKGSIfl^REASSLPQLVEMIVDHQIETG^ 
SIJtfDIGK^SSASRMFTNPDNGSPAlVrra 

AEASNVLVGEVDFLDTPFIAFVRLQQAVMLGALTEVPVPTRF 

GRAIATLMSDEVFHDIAYKAKDRHDLIAGIDEFLDEVIVLPPGEWDPAIRIEPPKSLPS 
SDKRKNMYS GGENVQMNGDTPHDGGHGGGGHGDCEELQRTGRFCGGLI KD I KRKAPFFA 
SDFYDALNIQALSAILFIYLATVTNAITFGGLLGDATDNMQGVLESFLGTAVSGAIFCL 
FAGQPLTILSSTGPVLVFERIiLFNFSKDNNFDYIiEFRLW 

VQYFTRFTEEGFSSLISFIFIYDAFKKMIKIJUDYYPINSNFKVGYNTLFSCTCVPPDPA 
T NISISITOTTIiAPEYLPTMSSTDMYHNTTFDWAFIiSKKECSKYGGNL 

r MS F ILFLGTYTS smalkkfkts p yfpttarkli sdfai ilsilifcvidalvgvdtpkl 

IVPSEFKPTSPimGWFVPPFGEOTVWVCLAAAIPALLVTILIFI^QQITAVIV^ 
LKKGAGYHLDLFWVAILMVICSLMALPWWAATVTSIAHI^ 

GVREQRVTGTLVFILTGIiS VFMAP ILKF I PMPVLYGVFLYMGVASLNGVQFMDRIiKLLIi 
MPLKHQPDFIYLRHVPLRRVHLFTFLQVLCIi^^ 
r MDYLFSQHDLSFIiDDVIPEKDKKKKEDEKKKKKKKGSIiDSDOT 
T MDIMEQQPFLSDSKPSDRERSPTFLERHTSC" 

X 

2 Sequence 5322 BP; 1507 A; 1113 C; 1142 G; 1560 T; 0 other; 



cggcagtggc 


agtggccgct 


60 


agccagaccc 


caggaggatg 


120 


agcatgtgtg 


tgatgaagaa 


180 


tgccgaagag 


ttacaggaga 


240 


aggaaaagga 


gagaatctct 


300 


aatccagcag 


cagcatccta 


360 


tcttgggaga 


ggaggatgac 


420 


tgctggccgt 


ggatgggcag 


480 


aagaaaaagt 


ggaacagggt 


540 


ttcatagttt 


atttgagctg 


600 


aggcttcttc 


tctcccacag 


660 


tattgaaacc 


tgaacttaag 


720 


aaaccaagaa 


atccaacctt 


780 


gtaggatgtt 


taccaaccct 


840 


cctccagtct 


gaatgacatt 


900 


tgaaaaaatt 


gccacgtgat 


960 


tggatactcc 


tttcattgcc 


1020 


ctgaagttcc 


tgtgcccaca 


1080 


agtcctacca 


cgagattggc 


1140 


acattgctta 


taaagcaaaa 


1200 


atgaagtcat 


cgtccttcca 


1260 


agagtcttcc 


atcctctgac 


1320 


tgaatgggga 


tacgccccat 


1380 


aattgcagcg 


aactggacgg 


1440 


cattttttgc 


cagtgatttt 


1500 


tcatttatct 


ggcaactgta 


1560 


ctgacaacat 


gcagggcgtg 


1620 


tttgcctttt 


tgctggtcaa 


1680 


ttgagaggct 


tctatttaat 


1740 


tttggattgg 


cctgtggtcc 


1800 


tcttggttca 


atacttcaca 


1860 


tctttatcta 


tgatgctttc 


1920 



aagaagatga tcaagcttgc agattactac cccatcaact ccaacttcaa agtgggctac 
aacactctct tttcctgtac ctgtgtgcca cctgacccag ctaatatctc aatatctaat 
gacaccacac tggccccaga gtatttgcca actatgtctt ctactgacat gtaccataat 
actacctttg actgggcatt tttgtcgaag aaggagtgtt caaaatacgg aggaaacctc 
gtcgggaaca actgtaattt tgttcctgat atcacactca tgtcttttat cctcttcttg 
ggaacctaca cctcttccat ggctctgaaa aaattcaaaa ctagtcctta ttttccaacc 
acagcaagaa aactgatcag tgattttgcc attatcttgt ccattctcat cttttgtgta 
atagatgccc tagtaggcgt ggacacccca aaactaattg tgccaagtga gttcaagcca 
acaagtccaa accgaggttg gttcgttcca ccgtttggag aaaacccctg gtgggtgtgc 
cttgctgctg ctatcccggc tttgttggtc actatactga ttttcatgga ccaacaaatt 
acagctgtga ttgtaaacag gaaagaacat aaactcaaga aaggagcagg gtatcacttg 
gatctctttt gggtggccat cctcatggtt atatgctccc tcatggctct tccgtggtat 
gtagctgcta cggtcatctc cattgctcac atcgacagtt tgaagatgga gacagagact 
tctgcacctg gagaacaacc aaagtttcta ggagtgaggg agcaaagagt cactggaacc 
cttgtgttta ttctgactgg tctgtcagtc tttatggctc ccatcttgaa gtttataccc 
atgcctgtac tctatggtgt gttcctgtat atgggagtag catcccttaa tggtgtgcag 
ttcatggatc gtctgaagct gcttctgatg cctctgaagc atcagcctga cttcatctac 
ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt tcctgcaggt gttgtgtctg 
gccctgcttt ggatcctcaa gtcaacggtg gctgctatca tttttccagt aatgatcttg 
gcacttgtag ctgtcagaaa aggcatggac tacctcttct cccagcatga cctcagcttc 
ctggatgatg tcattccaga aaaggacaag aaaaagaagg aggatgagaa gaaaaagaaa 
aagaagaagg gaagtctgga cagtgacaat gatgattctg actgcccata ctcagaaaaa 
gttccaagta ttaaaattcc aatggacatc atggaacagc aacctttcct aagcgatagc 
aaaccttctg acagagaaag atcaccaaca ttccttgaac gccacacatc atgctgataa 
aattcctttc cttcagtcac tcggtatgcc aagtcctcct agaactccag taaaagttgt 
gcctcaaatt agaatagaac ttgaacctga agacaatgat tatttctgga ggagcaaggg 
aacagaaact acattgtaac ctgtttgtct ttcttaaaac tgacatttgt tttaatgtca 
tttgtttttg tttggctgtt tgtttatttt ttaactttta tttcgtctca gtttttggtc 
acaggccaaa taatacagcg ctctctctgc ttctctcttg catagataca atcaagacaa 
tagtgcaccg ttccttaaaa acagcatctg aggaatcccc cttttgttct taaactttca 
gatgtgtcct ttgataacca aattctgtca ctcaagacac agacacccac agaccctgtc 
ctttgcctct attaagcaga ggatggaagt attaaggatt ttgtaacacc ttttatgaaa 
atgttgaagg aacttaaaac tttagctttg gagctgtgct tactggcttg tctttgtctg 
gtagaacaaa ccttgacctc cagacagagt cccttctcac ttatagagct ctccaggact 
ggaaaaagtg ctgctatttt aacttgctct tgcttgtaaa tcctaatctt agagttatca 
aaagaagaaa aaactgaagg tactttactc cctatagaga aaccattgcc atcattgtag 
caagtgctgg aatgtccctt ttttcctatg caactttttt taacccttta atgaacttat 
ctgttgagta cattgaagaa tatttttctt cctagatttt gttgtttaaa ttatggggcc 
taacctgcca cttatttttt gtcaattttt aaaacttttt tttaattact gtaaagaaaa 
tgaatttttt cctgcagcag gaaacatagt tttgagtagt tctacctctt atttgtagct 
gccaggcttt ctgtaaaaat tgtattgtat ataatgtgat ttttacacat acatacacac 
acaaatacac aatctctagg gtaagccaga aggcaagatc agattaaaaa caccatgttt 
ctaagcatcc atttttccct ttctttaaaa gaaacttaac tgttctatga aggagattga 
gggagaagag acaaactcct atgtcatgag aataaccgat gttctgataa tagtagcatc 
taggtacaga tgctggttgt attaccacgt caatgtccta tgcagtattg ttagacattt 
tctcattttg aaatatttgt gtgtttgtgt atgtgctctg tgccatggct ggtgtatata 
tgtgcaatgt tagaaggcaa aagagtgatg gtaggcagag ggcaaagtca ttgaatctct 
tatgccagtt ttcataaaac ccaaaccaca tatgaaaaaa tccattaagg gtccaagaag 
tctgtccata tgaaaatgag ggtaaatata gtttatttcc caggtatcag tcattataat 
tgatataata gctctaacat gcaatataaa attcatagga gtattaatag cccatttaca 
catctataaa atgtaatggg attgcagagc tgcagagtac agtgtaacag tactctcatg 
caattttttt caggatgcaa aggcaattat tctttgtaag cgggacattt agaatatatt 
tgtgtacata ttatatgtat gtatatttca aagtaccaca ctgaaaatta gacatttatt 
aaccaaattt aacgtggtat ttaaaggtaa tatttttaat atgatacatt acatattgtg 
aatgtatact aaaaaaacat tttaaatgtt aaaattataa tttcagattc atataaccac 
aactgtgata tatcctaact ataaccagtt gttgaggggt atactagaag cagaatgaaa 
ccacattttt tggtttgata atatgcactt attgactccc ac 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5322 



B Homo sapiens interferon stimulated T-cell alpha chemoat tract ant precursor, 
E mRNA, complete cds . 

T / translation n MSVKGMAI ALAVILCATWQGFPMFKRGRCLC IGPGVKAVKVAD I 

T EKASIMY PSNNCDKIEVI ITLKENKGQRCLNPKSKQARLI IKKVERKNF " 

ctccttccaa gaagagcagc aaagctgaag tagcagcaac agcaccagca gcaacagcaa 60 

aaaacaaaca tgagtgtgaa gggcatggct atagccttgg ctgtgatatt gtgtgctaca 120 

gttgttcaag gcttccccat gttcaaaaga ggacgctgtc tttgcatagg ccctggggta 180 

aaagcagtga aagtggcaga tattgagaaa gcctccataa tgtacccaag taacaactgt 24 0 

gacaaaatag aagtgattat taccctgaaa gaaaataaag gacaacgatg cctaaatccc 300 

aaatcgaagc aagcaaggct tataatcaaa aaagttgaaa gaaagaattt ttaaaaatat 360 

caaaacatat gaagtcctgg aaaagggcat ctgaaaaacc tagaacaagt ttaactgtga 420 

ctactgaaat gacaagaatt ctacagtagg aaactgagac ttttctatgg ttttgtgact 4 80 

ttcaactttt gtacagttat gtgaaggatg aaaggtgggt gaaaggacca aaaacagaaa 540 

tacagtcttc ctgaatgaat gacaatcaga attccactgc ccaaaggagt ccagcaatta 600 

aatggatttc taggaaaagc taccttaaga aaggctggtt accatcggag tttacaaagt 660 

gctttcacgt tcttacttgt tgtattatac attcatgcat ttctaggcta gagaaccttc 720 

tagatttgat gcttacaact attctgttgt gactatgaga acatttctgt ctctagaagt 780 

tatctgtctg tattgatctt tatgctatat tactatctgt ggttacagtg gagacattga 840 

cattattact ggagtcaagc ccttataagt caaaagcatc tatgtgtcgt aaagcattcc 900 

tcaaacattt tttcatgcaa atacacaytt ctttccccaa atatcatgta gcacatcaat 960 

atgtagggaa acattcttat gcatcatttg gtttgtttta taaccaattc attaaatgta 1020 

attcataaaa tgtactatga aaaaaattat acgctatggg atactggcaa cagtgcacat 1080 

atttcataac caaattagca gcaccggtct taatttgatg tttttcaact tttattcatt 1140 

gagatgtttt gaagcaatta ggatatgtgt gtttactgta ctttttgttt tgatccgttt 12 00 

gtataaatga tagcaatatc ttggacacat ttgaaataca aaatgttttt gtctaccaaa 1260 

gaaaaatgtt gaaaaataag caaatgtata cctagcaatc acttttactt tttgtaattc 1320 

tgtctcttag aaaaatacat aatctaatca aaaaaaaaaa aaaaaaaaaa a 1371 



Homo sapiens mRNA; cDNA DKFZp586J0323 (from clone DKFZp586J0323) 

gtttggaagt gatagcaaat aaaagccacc ttgaactggg tctgatgcag cattcttacc 60 

aaacttcaaa cctggactag ttcatcctta tgcattgagg cttttttatt cgtgttcgtt 120 

tgtttttacc agttaactat caacataaat ttcatttata atattgtatg ttcagtgttc 180 

caaaaaactg gtcctaccaa catagtttgg aatgactcca ttataagatg gtgactgcct 240 

gtatcaaatc tttactgcct ttttcaaatt cttaccattt ttataaaaag gagtcacact 300 

actcaatcta tacatcagtg ttaaatatga tttttactaa tttttttttt ttttaccaac 360 

actatcttaa aaaatctgac agcatagagc agtgattaaa ggcatttgct tcagggtcaa 420 

atatagttac actgctgttt tttggacaat ttgttatttt gaaccattgt ttcttacctt 480 

tataaaatga gcataagata atgttctttt aaggtgagta tgagatacaa atgagaaaag 540 

caataataat aaagattcaa caatggaaac tgctatttac attatgattg ttataattag 600 

aaggacaaac tgtaatttaa cgttcctata gtaataaaat ggcatctaca gagcaaatct 660 

aaacagactt aatcttcata taacaattca tcccagataa tttgaattga ccataataac 720 

atgtttgaaa ggaggctgaa ataaaacagg gtttgctctt ttcaactctt tagccaagac 7 80 

tttttttaaa aaaaactggt atataaatgc tttgtatttc ctttcaagtt tgaagggaaa 840 

ataaatataa tacttaatag attttcaagt atctctttag acattctctt gtttaggctt 900 

gtactaatcc attcattcag ggtttgactg ttgggtgaac ttctttgccc tattcccagc 960 

tgtgagaggc aattctccag gtttctaagc tttagaccta tgcccttctc tgatgaggtt 1020 

aattagggtt tgggctgaaa cccagattcc tatatatgtg gatagagtga tgtagaagta 1080 

ctttatgata ataaatataa atgaaattta gattttaatt tagaaataga aaacatttag 1140 

gcaactcact gaatcaaaaa taaatataga caaatttaat taatatattt atttattata 1200 

ttttcctggg ttgggctttg gcctctattt catatatttt gtttattttt caggatgttc 1260 

tgggataatg tagtccaggt tctagttctg cctttgtcac ttactagcca tctagtcata 132 0 

taatgctgaa aaatatatta tgtagctcta agataaattc aacgtacctg catagcacta 1380 

ttcaacatac ttgcacagta ttattgtcaa tatgtagtct aacagataag aaaccactgt 144 0 

gaaaatagta aaccattata aaaaatgatg ttttataatg atacatcttt gcacttctaa 1500 

tattattgct ttaaataaat gtggtatata ttcagcatac agccaaattt acaaaatttt 1560 

atctttttaa ttttatgtca tatataaaga accttaatgc caaaataatc tttatagtga 162 0 

tgactcgaca tcccaaaaat gtgggaccca acagggacat taataaagag ataaagcagg 168 0 

ctctgtgccg acttgttaga atttggtacg tggcaaatgt gcagtcacac ggccttaaac 1740 

ctcccctggg aaactgctca aacatttgca ttcccacaat gtatttattc aggaaatact 1800 

cttttcctga gaaaaatctg gctctcaaac ctctccctga gattatgcaa gttctccatt 1860 

ggggaaatac agttggaaat ctagtaaaga atgtttatta acaaaaatcc atttgcagat 1920 

cctagttatt ctcaatattt tgtggattta caattgagaa tactacttgc aaaaagtaaa 1980 

cttcaaaaaa aggaaataga aagctatcag aattcctgta gagtttgttg ttccccttct 2 04 0 

cctccaacat agttttattc taatgttttc tttattcatt ttctttggta ttagtgtgct 2100 

gtaaactcac ccaaaatgaa aaacaatcaa aataaaataa aacaaaatga aagcacatca 2160 

gttaaagata aaacaaaaca aaaaacaggc tgccaggttt tagtggacag aagggtaggc 2220 

cccctggaga tcagccaaat tgagtggatg gaagtaatca acaacaagag gacaacaaca 2280 

ttagccactg ctgagcagaa aattagaaag attcttcaat ttattttttt ttcttttctg 2340 

gttgtcaggt tttgtctctc ctgtcttcca ctggtacacc ataaatcagc agctaaaatg 2400 

aaatttcctt cttttagaca tggcaaagta ctcattgagc ttagagatta atagtaaaac 2460 

tttaaaaaaa aaaaaaaaaa 24 80 



Homo sapiens cDNA FLJ20637 fis, clone KAT03212. 



/ translations "MQMSEKKAYMLMHETIIjQKKDEFPPSPRFILRVRRSRIjVKDAIiRQ 
LSQAEATDFCKVLVVEFINEICPESGGVSSEFFHCMFEEMTKPEYGMFMYPEMCSCMWF 
PAKPKPEKKRYFLFGMLCGLSLFNLNVANLPFPLALYKK^ 

SLQEVIiDDAADDIGDAIjCIRFS IHWDQNDVDLIPNGIS IPVDQTNKRDYVSKYIDYIFN 
VSVKAVYEEFQRGF YRVCEKE ILRHFYPEELMTAI IGNTDYDWKQFEQNS KYEQGYQKS 
HPTIQLFWKAFHKLTIiDEKKKFLFFLTGiaDRLHARGIQKMEIVFRC 
TCHWILSLPKYSTMERMEEALQVAINNNRGFVS PMLTQS " 



Sequence 2010 BP; 640 A; 415 C; 397 G; 558 T; 0 other; 

gtcgactacc agaaaatact ttcaacataa atgaactctc caacttatta aacttttata 60 

tagatagagg aagacagctc tttcgggata accacctgat acctgcagaa acccccagtc 120 

ctgttatttt cagtgatttt ccatttatct ttaattcgct atccaaaatt aaattattgc 180 

aagctgattc acatataaag atgcagatgt cagaaaagaa agcatacatg cttatgcatg 240 

aaacaattct gcaaaaaaag gatgaatttc ctccatcacc cagatttata cttagagtca 300 

gacgaagtcg cctggttaaa gatgctctgc gtcaattaag tcaagctgaa gctactgact 360 

tctgcaaagt attagtggtt gaatttatta atgaaatttg tcctgagtct ggaggggtta 420 

gttcagagtt cttccactgt atgtttgaag agatgaccaa gccagaatat ggaatgttca 480 

tgtatcctga aatgtgttcc tgcatgtggt ttcctgccaa gcctaaacct gagaagaaaa 540 

gatatttcct ctttggaatg ctgtgtggac tctccttatt caatttaaat gttgctaacc 600 

ttcctttccc actggctctg tataaaaaac ttctggacca aaagccatca ttggaagatt 660 

taaaagaact cagtcctcgg ttggggaaga gtttgcaaga agttctagat gatgctgctg 720 

atgacattgg agatgcgctc tgcatacgct tttctataca ctgggaccaa aatgatgttg 780 

acttaattcc aaatgggatc tccatacctg tggaccaaac caacaagaga gactatgttt 840 

ctaagtatat tgattacatt ttcaacgtct ctgtaaaagc agtttatgag gaatttcaga 900 

gaggatttta tagagtctgt gagaaggaga tacttagaca tttctaccct gaagaactaa 960 

tgacagcaat cattggaaat actgattatg actggaaaca gtttgaacag aattcaaagt 1020 

atgagcaagg ataccaaaaa tcacatccta ctatacagtt gttttggaag gctttccaca 1080 

agctaacctt ggatgaaaag aaaaaattcc tctttttcct tacaggacgt gataggctgc 1140 

atgcaagagg catacagaaa atggaaatag tatttcgctg tcctgaaact ttcagtgaaa 1200 

gagatcaccc aacatcaata acttgtcata atattctctc cctccctaag tattctacaa 1260 

tggaaagaat ggaggaagca ctccaagtag ccatcaacaa caacagagga tttgtctcac 1320 

ccatgctcac acagtcataa tcacctctga gagactcagg gtgggctttc tcacacttgg 1380 

atccttctgt tcttccttac acctaaataa tacaagagat taatgaatag tggttagaag 1440 

tagttgaggg agagattggg ggaatgggga gatgatgatg atggtcaaag ggtgcaaaat 1500 

ctcacacaag actgaggcag gagaataggg tacagagata gggatctaag gatgacttgg 1560 

acacactccc tggcactgaa gagtctgaac actggcctgt gattggtcca ttccaggacc 1620 

ttcatttgca taaggtatca aaccacatca gcctctgatt ggccatgggc cagacctgca 1680 

ctctggccaa tgattggttc attccaggac attcatttgc ataaggagtc aaaccacacc 1740 

agtcttggat tggctgtgag ccaattcacc tcagtctcta attggctgtg agtcagtctt 1800 

tcatttacat agggtgtaac catcaagaaa cctctacagg gtacttaagc cccagaagat 1860 

tttgctacca gggctcttga gccacttgct ctagcccact cccaccctgt ggaatgtact 1920 

ttcacttttg ctgcttcact gccttgtgct ccaataaatc cactccttca ccacccaaaa 1980 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2010 



Homo sapiens sodium bicarbonate cotransporter (HNBC1) mRNA, complete cds. 

/ 1 rans 1 at i on= n MSTENVEGKPSNLGERGRARS STFLRVVQPMFNHS I FTS AVS PAA 
ERIRFILGEEDDSPAPPQLFTELDEIiliAVDGQEMEWKET^ 

HVATLSI1HSI1FEI1RTCMEKGS IMU5REAS SLPQLVEMIVDHQIETGIjIjKPELKDKVTYT 
LIiRKHRHQTKKSNIjRSLiAD IGKTVS S ASRMFTNPDNGS PAMTHRNIiTS S S LND X SDKPE 
KDQIiKNKFMKKLPRDAEASNV^ 

ILIiGPKGKAKSYHEIGRAIATLMSDEVFHDIAYKAKDMTOLI^ 

W)PAIRIEPPKSLPSSDKRKWMYSGGENVQMNGDTPHDGGHGGGGHGDCEELQRTGRFC 
GGLIKDIKRKAPFFASDFYDALNIQALSAILFIY^ 

ES FLGTAVSGAIFCLFAGQPIiTIIiS STGPVLVFERLIiFNFSKDNNFDYLEFRLW I GLWS 
AFLCL ILVATDAS FLVQYFTRFTEEGFS SLI S F IF I YDAFKKM I KLADYYP INSNFKVG 
YNTLFSCTCVPPDPANI S I SNDTTLAPEYIiPTMSSTDMYHNTTFDWAFLSKKECSKYGG 
NLVGNNC^FVPDITLMSFILFIiGTYTSSMAIiKKFOT 

FCVID ALVGVDTP KL IVPSE FKPTS PNRGWF VP PFGENPWWVCIjAAAI PALLVT ILIFM 
DQQITAVIVNRKEHKLKKGAGYHLDIjFWAIIjMVICSLMAIiPWYVAATVISIAHIDSLK 
METETS APGEQPKFLGVREQRVTGTIjVF I LTGIiS VFMAP I LKF I PMPVLYGVFIjYMGVA 
SliNGVQFMDRLKLLLMPLKHQPDFIYIjRHTO 
IFPVMILALVAVRKGMDYLFSQHDL^^ 

SDCPYSEKVPSIKIPMDIMEQQPFLSDSKPSDRERSPTFLERHTSC" 



60 



Sequence 7586 BP; 2211 A; 1473 C; 1501 G; 2401 T; 0 other; 
gttctttgtg acacatcaca cagaattgga gtgctgtcct tctggagagt ggtggagaac 

caagatacag ttcagaacca aaggaataga gaagggcttt gatttctttt tggctttaga 120 

ttggggattt gggaggctta gcaggaaaga tgtccactga aaatgtggaa gggaagccca 180 

gtaaccttgg ggagagagga agagcccgga gctccacttt cctcagggtt gtccagccaa 240 

tgtttaacca cagtattttc acttctgcag tctctcctgc tgcagaacgc atccgattca 300 

tcttgggaga ggaggatgac agcccagctc cccctcagct cttcacggaa ctggatgagc 360 

tgctggccgt ggatgggcag gagatggagt ggaaggaaac agccaggtgg atcaagtttg 420 

aagaaaaagt ggaacagggt ggggaaagat ggagcaagcc ccatgtggcc acattgtccc 480 

ttcatagttt atttgagctg aggacatgta tggagaaagg atccatcatg cttgatcggg 540 

aggcttcttc tctcccacag ttggtggaga tgattgttga ccatcagatt gagacaggcc 600 

tattgaaacc tgaacttaag gataaggtga cctatacttt gctccggaag caccggcatc 660 
aaaccaagaa atccaacctt cggtccctgg ctgacattgg gaagacagtc tccagtgcaa 720 
gtaggatgtt taccaaccct gataatggta gcccagccat gacccatagg aatctgactt 780 
cctccagtct gaatgacatt tctgataaac cggagaagga ccagctgaag aataagttca 84 0 

tgaaaaaatt gccacgtgat gcagaagctt ccaacgtgct tgttggggag gttgactttt 900 
tggatactcc tttcattgcc tttgttaggc tacagcaggc tgtcatgctg ggtgccctga 960 

ctgaagttcc tgtgcccaca aggttcttgt tcattctctt aggtcctaag gggaaagcca 1020 

agtcctacca cgagattggc agagccattg ccaccctgat gtctgatgag gtgttccatg 1080 

acattgctta taaagcaaaa gacaggcacg acctgattgc tggtattgat gagttcctag 1140 

atgaagtcat cgtccttcca cctggggaat gggatccagc aattaggata gagcctccta 1200 

agagtcttcc atcctctgac aaaagaaaga atatgtactc aggtggagag aatgttcaga 1260 

tgaatgggga tacgccccat gatggaggtc acggaggagg aggacatggg gattgtgaag 1320 

aattgcagcg aactggacgg ttctgtggtg gactaattaa agacataaag aggaaagcgc 1380 

cattttttgc cagtgatttt tatgatgctt taaatattca agctctttcg gcaattctct 1440 

tcatttatct ggcaactgta actaatgcta tcacttttgg aggactgctt ggggatgcca 1500 

ctgacaacat gcagggcgtg ttggagagtt tcctgggcac tgctgtctct ggagccatct 1560 

tttgcctttt tgctggtcaa ccactcacta ttctgagcag caccggacct gtcctagttt 1620 

ttgagaggct tctatttaat ttcagcaagg acaataattt tgactatttg gagtttcgcc 1680 

tttggattgg cctgtggtcc gccttcctat gtctcatttt ggtagccact gatgccagct 1740 

tcttggttca atacttcaca cgtttcacgg aggagggctt ttcctctctg attagcttca 1800 

tctttatcta tgatgctttc aagaagatga tcaagcttgc agattactac cccatcaact I860 

ccaacttcaa agtgggctac aacactctct tttcctgtac ctgtgtgcca cctgacccag 1920 

ctaatatctc aatatctaat gacaccacac tggccccaga gtatttgcca actatgtctt 198 0 

ctactgacat gtaccataat actacctttg actgggcatt tttgtcgaag aaggagtgtt 2040 

caaaatacgg aggaaacctt gtcgggaaca actgtaattt tgttcctgat atcacactca 2100 



tgtcttttat cctcttcttg ggaacctaca cctcttccat ggctctgaaa aaattcaaaa 2160 

ctagtcctta ttttccaacc acagcaagaa aactgatcag tgattttgcc attatcttgt 2220 

ccattctcat cttttgtgta atagatgccc tagtaggcgt ggacacccca aaactaattg 22 80 

tgccaagtga gttcaagcca acaagtccaa accgaggttg gttcgttcca ccgtttggag 2340 

aaaacccctg gtgggtgtgc cttgctgctg ctatcccggc tttgttggtc actatactga 24 00 

ttttcatgga ccaacaaatt acagctgtga ttgtaaacag gaaagaacat aaactcaaga 2460 

aaggagcagg gtatcacttg gatctctttt gggtggccat cctcatggtt atatgctccc 2520 

tcatggctct tccgtggtat gtagctgcta cggtcatctc cattgctcac atcgacagtt 2580 

tgaagatgga gacagagact tctgcacctg gagaacaacc aaagtttcta ggagtgaggg 2640 

aacaaagagt cactggaacc cttgtgttta ttctgactgg tctgtcagtc tttatggctc 2700 

ccatcttgaa gtttataccc atgcctgtac tctatggtgt gttcctgtat atgggagtag 2760 

catcccttaa tggtgtgcag ttcatggatc gtctgaagct gcttctgatg cctctgaagc 2820 

atcagcctga cttcatctac ctgcgtcatg ttcctctgcg cagagtccac ctgttcactt 2880 

tcctgcaggt gttgtgtctg gccctgcttt ggatcctcaa gtcaacggtg gctgctatca 2940 

tttttccagt aatgatcttg gcacttgtag ctgtcagaaa aggcatggac tacctcttct 3000 

cccagcatga cctcagcttc ctggatgatg tcattccaga aaaggacaag aaaaagaagg 3060 

aggatgagaa gaaaaagaaa aagaagaagg gaagtctgga cagtgacaat gatgattctg 3120 

actgcccata ctcagaaaaa gttccaagta ttaaaattcc aatggacatc atggaacagc 3180 

aacctttcct aagcgatagc aaaccttctg acagagaaag atcaccaaca ttccttgaac 3240 

gccacacatc atgctgataa aattcctttc cttcagtcac tcggtatgcc aagtcctcct 3300 

agaactccag taaaagttgc ctcaaattag actagaactt gaacctgaag acaatgatta 3360 

tttctggagg agcaagggaa cagaaactac attgtaacct gtttgtcttt cttaaaactg 3420 

acatttgttg ttaatgtcat ttgtttttgt ttggctgttt gtttattttt taacttttat 34 80 

ttcgtctcag tttttggtca caggccaaat aatacagcgc tctctctgct tctctcttgc 354 0 

atagatacaa tcaagacaat agtgcaccgt tccttaaaaa cagcatctga ggaatccccc 3600 

ttttgttctt aaactttcag atgtgtcctt tgataaccaa attctgtcac tcaagacaca 3660 

gacacccaca gaccctgtcc tttgcctcta ttaagcagag gatggaagta ttaaggattt 3720 

tgtaacacct tttatgaaaa tgttgaagga acttaaaact ttagctttgg agctgtgctt 3780 

actggcttgt ctttgtctgg tagaacaaac cttgacctcc agacagagtc ccttctcact 3840 

tatagagctc tccaggactg gaaaaagtgc tgctatttta acttgctctt gcttgtaaat 3900 

cctaatctta gagttatcaa aagaagaaaa aactgaaggt actttactcc ctatagagaa 3960 

accattgcca tcattgtagc aagtgctgga atgtcccttt tttcctatgc aactttttta 4 020 

taacccttta atgaacttat ctgtggagta cattgaagaa tatttttctt cctagatttt 4 080 

gttgtttaaa ttatggggcc taacctgcca cttatttttt gtcaattttt aaaacttttt 414 0 

tttaattact gtaaagaaaa tgaatttttt cctgcagcag gaaacatagt tttcagtagt 4200 

tctacctctt atttgtagct gccaggcttt ctgtaaaaat tgtattgtat ataatgtgat 4260 

ttttacacat acatacacac acaaatacac aatctctagg gtaagccaga aggcaagatc 4320 

agattaaaaa caccatgttt ctaagcatcc atttttccct ttctttaaaa gaaacttaac 43 80 

tgttctatga aggagattga gggagaagag acaaactcct atgtcatgag aataaccgat 4440 

gttctgataa tagtagcatc taggtacaga tgctggttgt attaccacgt caatgtccta 4500 

tgcagtattg ttagacattt tctcattttg aaatatttgt gtgtttgtgt atgtgctctg 4560 

tgccatggct ggtgtatata tgtgcaatgt tagaaggcaa aagagtgatg gtaggcagag 4620 

ggcaaagtca ttgaatctct tatgccagtt ttcataaaac ccaaaccaca tatgaaaaaa 4 680 

tccattaagg gtccaagaag tctgtccata tgaaaatgag ggtaaatata gtttatttcc 4740 

caggtatcag tcattataat tgatataata gctctaacat gcaatataaa attcatagga 4800 

gtattaatag cccatttaca catctataaa atgtaatggg attgcagagc tgcagagtac 4860 

agtgtaacag tactctcatg caattttttt caggatgcaa aggcaattat tctttgtaag 4920 

cgggacattt agatatattt gtgtacatat tatatgtatg tatatttcaa agtaccacac 4980 

tgaaaattag acatttatta accaaattta acgtggtatt taaaggtaat atttttaata 5040 

tgatacatta catattgtga atgtatacta aaaaaacatt ttaaatgtta aaattataat 5100 

ttcagattca tataaccaca actgtgatat atcctaacta taaccagttg ttgaggggta 5160 

tactagaagc agaatgaaac cacatttttt ggtttgataa tatgcactta ttgactccca 5220 

ctcattgtta tgttaattaa gttattattc tgtctccttg taattttgat tacaaaaatt 5280 

ttattatcct gagttagctg ttacttttac agtacctgat actcctaaaa cttttaactt 5340 

atacaaatta gtcaataatg accccaattt tttcattaaa ataatagtgg tgaattatat 5400 

gttattgtgt taaaacctca cttgccaaat tctggcttca catttgtatt tagggctatc 5460 

cttaaaatga tgagtctata ttatctagct ttctattacc ctaatataaa ctggtataag 5520 



aagactttcc 
agtaaattca 
tctagataaa 
tatacctttc 
taaaacctat 
tagtacctta 
ccagggaagt 
acaagcaaaa 
cccattgact 
aaattacaac 
agataaagga 
ggctgtggtt 
atcaaaagag 
gttttccttg 
gttttgtgtg 
gggctgggga 
ccttccacct 
gtattttatg 
gtttaataga 
acttgaaatg 
attacagagg 
acttaatgat 
atttgctctt 
gtgatatttt 
gcttcattgc 
ttttggccag 
acgtataaaa 
tccagtgaaa 
gaatagttgc 
cagaatcaat 
tggagagaga 
aatgataaag 
taaaattcct 



ttttttcttt 



gaattaagag 
tacttaaaaa 



gcttaacccg 
actttctttt 
aaagaactcc 
gtgtacttct 
catttgctaa 
ttttaaaaat 
cctgaggttt 
aagaatgaac 
agtaaatttt 
ttcaaaagat 
gcagaaacac 
caaaaatgcc 
ataaaagtca 
ccaggggcag 
gaggacttaa 
ccaagagagg 
ctcatgcacc 
gaatgattgt 
tattgactgc 
ttgacattgt 
tttggtgcag 
cacagcatga 
gtagcacctc 
agcctgaaat 
agatttagat 
agcaagaatt 
cttttagagc 
ctctttttag 
acgcactttc 
aatgatgcat 
aacagaaaac 
ataacatagt 
atgaagaaga 
taaagtaact 



atgcatggaa 
tgatcttctt 
ctgatcatga 
tgaagcaact 
gtttatgatc 
acagacagtt 
gtcattactt 
acctagtgac 
cggatttggt 
tttataagtg 
aaagacaaag 
tgttggaaga 
tctggttttg 
tcttctctac 
taatgtccct 
ctgacttaag 
aggaaaaaca 
aacccataca 
cttccgagtt 
tgactatatt 
tcagggatgg 
gaacctgaga 
gcatgaagcc 
actatctgaa 
tttaaaaaag 
atgtccaatt 
ctgtttccta 
cagaagtaac 
ccattttcat 
agatattctt 
ctttttattt 
atttcaatat 
attttacagt 
tgagatattt 
ttatgc 



gcatcaataa 
aagttaaagg 
atcaagtatc 
taactcatca 
catattgata 
aatatcaaat 
tcaaaggaac 
accaaattat 
ggtggttttg 
cttctccctt 
tacgctcaga 
aaagagatga 
tgtttgggag 
tgtgtgaaat 
gcctcttctc 
aagtaggaaa 
gttgtctgct 
gagtaaatct 
ttttggttcc 
ttaaaaacaa 
gacaaagcct 
ttttctgatt 
cagtggcacc 
aggccatgag 
ttgtgtaata 
tcctggctca 
ggcaaacatt 
tttgtcccag 
ggctggtaca 
atttttattc 
taaatgaagt 
attactaata 
tttatgaagc 
tagcatttat 



attgtttaaa 
tacttttgtt 
tgtggtttca 
tttcagcctc 
tttatgacat 
cctttcaata 
agaagtagtt 
cggtatttta 
tttctatgca 
ctccatgatg 
gttgttaacc 
ctaagtcaag 
aaaaatatct 
gaatacttgg 
ccaatcaagg 
acaaaaacct 
gtctgtaatt 
tttatcaact 
ttttttaact 
aatgaaataa 
tcttcaatcc 
tatatttcat 
aaatggctgg 
ttttcagatg 
cgccaaccag 
tttcattgtg 
gcaactcagg 
tcctacaatg 
tattcgtacg 
tcttaagtct 
agatcaacat 
actttttcca 
tttctattgt 
atttttcaaa 



aaccatgtat 
ttataaaagc 
tgcccctctc 
tgagtagagg 
gaacacagaa 
ttctgggaac 
aaccaaacta 
actgaattta 
aactggacac 
tgacttccgg 
agaaagtcct 
tgtctgcctt 
tggacgcact 
aattctaatt 
ttgaggagtg 
ctctcctcag 
cagtttgcgt 
atatactggt 
gtgttaaagt 
tttgagttgt 
ttttcatact 
gatatttcac 
gtacaatcaa 
atttcattga 
tcaagttgtg 
ctctatgggt 
gctaaagtca 
tgaaaagagt 
cattactttt 
ttattaactt 
ggtggaacaa 
atataaatcc 
gacttttatg 
attatatgta 



5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7586 



E Human BRCAl-associated RING domain protein (BARDl) mRNA, complete cds. 

T / translation^ n MPDNRQPRNRQPRIRSGNEPRSAPAME PDGRGAWAHS RAALDRLE 

T KLIACSRCTNILREPVCLGGCEHIFCSNC^^ 

T IQLCSKLRNTLLHDNELSDLKEDK^ 

T QTQPAIKKDASAQQDSYEFVSPSPPADVSERAKKASARSGKKQK3OT 

T EKEDGEFDSKEESKQKLVSFCSQPSVISSPQINGEIDLIiASGSLTESECFGSLTEVSLP 

T IiAEQIESPDTKSRNEVVTPEKVCKNYLTSKKSLPIiEm 

T LSTSGDFVKQTVPSENIPLPECSSPPSCKRKVGGTSGRKNSNMSDEFISIiSPGTPPSTL 

T SS S S YRQVMS S PS AMKLLPNMAVKRNHRGETLLHI AS I KGD I PS VEYLLQNGSDPNVKD 

T HAGWTPIiHEACHiniGHLKVVEIiLIjQHKALVNTTGYQNDS PLHDAAKNGHVD I VKLIiLS YG 

T ASRNAVNIFGLRPVDYTDDESMKSIiLLLPEKNESSSASHCSVMNTGQRRDGPLVLIGSG 

T ^SSEQQKIYn^SEIAVILKAKKYTEFDSTV^ 

T VKACLRRKVCEQEEKYEIPEGPRRSRLNREQLLPKLFDGCYFYLWGTFKHHPKD 

T VTAGGGQIIiSRKPKPDSDVTQTINTVAYHARPDSDQRFCTQYI I YEDLCNYHPER VRQG 

T KVWKAPSSWFIDCVMSFELLPLDS" 

X 

Q Sequence 253 0 BP; 762 A; 522 C; 587 G; 659 T; 0 other; 



tctgcgagga 


gcctttcatc 


60 


cggcagccga 


ggatccgctc 


120 


ggtcgcggtg 


cctgggccca 


180 


tgctcgcgtt 


gtactaacat 


240 


ttctgtagta 


attgtgtaag 


300 


gcctggatac 


aagacttgaa 


360 


aagcttcgaa 


atttgctaca 


420 


aaaagtttgt 


ttaatgatgc 


480 


cgaagtaaga 


aagtcagata 


540 


aaaaaagatg 


caagtgctca 


600 


gatgtttctg 


agagggctaa 


660 


actttagctg 


aaatcaacca 


720 


gactccaaag 


aggaatctaa 


780 


tccagtcctc 


agataaatgg 


840 


gaatgttttg 


gaagtttaac 


900 


gacactaaga 


gcaggaatga 


960 


tctaagaaat 


ctttgccatt 


1020 


agtcccattt 


ctaagagatg 


1080 


caaaccgtgc 


cctcagaaaa 


1140 


cgtaaagttg 


gtggtacatc 


1200 


ctttcaccag 


gtacaccacc 


1260 


agtccctcag 


caatgaagct 


1320 


actttgctcc 


atattgcttc 


1380 


aatggaagtg 


atccaaatgt 


1440 


aatcatgggc 


acctgaaggt 


1500 


accgggtatc 


aaaatgactc 


1560 


gtcaagctgt 


tactttccta 


1620 


cctgtcgatt 


atacagatga 


1680 


gaatcatcct 


cagctagcca 


1740 


cttgtactta 


taggcagtgg 


1800 


gtaattctta 


aggctaaaaa 


1860 


cctggtgatg 


cagttcaaag 


1920 


attctaaaat 


ttgaatgggt 


1980 


aagtatgaaa 


ttcctgaagg 


2040 


aagctgtttg 


atggatgcta 


2100 


aaccttatta 


agctcgtcac 


2160 


gacagtgacg 


tgactcagac 


2220 


cagcgcttct 


gcacacagta 


2280 


gttcggcagg 


gcaaagtctg 


2340 



gaaggctcct tcgagctggt ttatagactg tgtgatgtcc tttgagttgc ttcctcttga 2400 

cagctgaata ttataccaga tgaacatttc aaattgaatt tgcacggttt gtgagagccc 2460 

agtcattgta ctgtttttaa tgttcacatt tttacaaata ggtagagtca ttcatatttg 2520 
tctttgaatc 



\ 



>E Human 18S rRNA gene, complete. 

ccgtccgtcc gtcgtcctcc tcgcttgcgg ggcgccgggc ccgtcctcga gcccccnnnn 60 

nccgtccggc cgcgtcgggg cctcgccgcg ctctacctac ctacctggtt gatcctgcca 12 0 

gtagcatatg cttgtctcaa agattaagcc atgcatgtct aagtacgcac ggccggtaca 180 

gtgaaactgc gaatggctca ttaaatcagt tatggttcct ttggtcgctc gctcctctcc 24 0 

tacttggata actgtggtaa ttctagagct aatacatgcc gacgggcgct gacccccttc 3 00 

gcggggggga tgcgtgcatt tatcagatca aaaccaaccc ggtcagcccc tctccggccc 3 60 

cggccggggg gcgggccgcg gcggctttgg tgactctaga taacctcggg ccgatcgcac 420 

gccccccgtg gcggcgacga cccattcgaa cgtctgccct atcaactttc gatggtagtc 480 

gccgtgccta ccatggtgac cacgggtgac ggggaatcag ggttcgattc cggagaggga 54 0 

gcctgagaaa cggctaccac atccaaggaa ggcagcaggc gcgcaaatta cccactcccg 600 

acccggggag gtagtgacga aaaataacaa tacaggactc tttcgaggcc ctgtaattgg 660 

aatgagtcca ctttaaatcc tttaacgagg atccattgga gggcaagtct ggtgccagca 72 0 

gccgcggtaa ttccagctcc aatagcgtat attaaagttg ctgcagttaa aaagctcgta 780 

gttggatctt gggagcgggc gggcggtccg ccgcgaggcg agccaccgcc cgtccccgcc 84 0 

ccttgcctct cggcgccccc tcgatgctct tagctgagtg tcccgcgggg cccgaagcgt 900 

ttactttgaa aaaattagag tgttcaaagc aggcccgagc cgcctggata ccgcagctag 960 

gaataatgga ataggaccgc ggttctattt tgttggtttt cggaactgag gccatgatta 1020 

agagggacgg ccgggggcat tcgtattgcg ccgctagagg tgaaattctt ggaccggcgc 1080 

aagacggacc agagcgaaag catttgccaa gaatgttttc attaatcaag aacgaaagtc 1140 

ggaggttcga agacgatcag ataccgtcgt agttccgacc ataaacgatg ccgaccggcg 12 00 

atgcggcggc gttattccca tgacccgccg ggcagcttcc gggaaaccaa agtctttggg 12 60 

ttccgggggg agtatggttg caaagctgaa acttaaagga attgacggaa gggcaccacc 1320 

aggagtggag cctgcggctt aatttgactc aacacgggaa acctcacccg gcccggacac 1380 

ggacaggatt gacagattga tagctctttc tcgattccgt gggtggtggt gcatggccgt 1440 

tcttagttgg tggagcgatt tgtctggtta attccgataa cgaacgagac tctggcatgc 1500 

taactagtta cgcgaccccc gagcggtcgg cgtcccccaa cttcttagag ggacaagtgg 1560 

cgttcagcca cccgagattg agcaataaca ggtctgtgat gcccttagat gtccggggct 1620 

gcacgcgcgc tacactgact ggctcagcgt gtgcctaccc tacgccggca ggcgcgggta 1680 

acccgttgaa ccccattcgt gatggggatc ggggattgca attattcccc atgaacgagg 1740 

aattcccagt aagtgcgggt cataagcttg cgttgattaa gtccctgccc tttgtacaca 1800 

ccgcccgtcg ctactaccga ttggatggtt tagtgaggcc ctcggatcgg ccccgccggg 1860 

gtcggcccac ggcctggcgg agcgctgaga agacggtcga acttgactat ctagaggaag 1920 

taaaagtcgt aacaaggttt ccgtaggtga acctgcggaa ggatcatta 1969 




Human mRNA for 56-KDa protein induced by interferon 

/trans la tion= » MSTNGDDHQVKDSLEQLRCTFTWELS IDDDEMPDIiENRVLDQIEF 
LDTKYSVG IHNLLAYVKHIiKGQNEEALKS LKEAENIjMQEEHDNQANVRSIjVTWGNFAWM 
YYHMGRLAEAQTYLDKVENICKKLSNPFRYRMECP 

FEKVLEVDPENPE S S AGYAI S AYRLDGFKIiATKNHKPFS LLPLRQ AVRLNPDNGYI KVL 
IiALKLQDEGQEAEGEKYIEEAIANMSSQTYVFRYAAKFYRRK 

PTSVIJbHHQIGIjCYKAQMIQIKEATKGQPRGQNREKIiDKMIRSAIFHFESAVEKKPTFE 
VAHIjDLARMY IEAGNHRKAEENFQKIjLCMKPVVEETMQD IHFYYGRFQEFQKKSDVNAI 
IHYIjKAIKIEQASIjTRDKS INSLKKLVLRKLRRKALDLESIjSLIjGFVYKIjEGNMNEALE 
YYERALRLAADFENSVRQGP " 

Sequence 1642 BP; 551 A; 318 C; 369 G; 404 T; 0 other; 
ccagatctca gaggagcctg gctaagcaaa accctgcaga acggctgcct aatttacagc 
aaccatgagt acaaatggtg atgatcatca ggtcaaggat agtctggagc aattgagatg 
tcactttaca tgggagttat ccattgatga cgatgaaatg cctgatttag aaaacagagt 
cttggatcag attgaattcc tagacaccaa atacagtgtg ggaatacaca acctactagc 
ctatgtgaaa cacctgaaag gccagaatga ggaagccctg aagagcttaa aagaagctga 
aaacttaatg caggaagaac atgacaacca agcaaatgtg aggagtctgg tgacctgggg 
caactttgcc tggatgtatt accacatggg cagactggca gaagcccaga cttacctgga 
caaggtggag aacatttgca agaagctttc aaatcccttc cgctatagaa tggagtgtcc 
agaaatagac tgtgaggaag gatgggcctt gctgaagtgt ggaggaaaga attatgaacg 
ggccaaggcc tgctttgaaa aggtgcttga agtggaccct gaaaaccctg aatccagcgc 
tgggtatgcg atctctgcct atcgcctgga tggctttaaa ttagccacaa aaaatcacaa 
gccattttct ttgcttcccc taaggcaggc tgtccgctta aatccagaca atggatatat 
taaggttctc cttgccctga agcttcagga tgaaggacag gaagctgaag gagaaaagta 
cattgaagaa gctctagcca acatgtcctc acagacctat gtctttcgat atgcagccaa 
gttttaccga agaaaaggct ctgtggataa agctcttgag ttattaaaaa aggccttgca 
ggaaacaccc acttctgtct tactgcatca ccagataggg ctttgctaca aggcacaaat 
gatccaaatc aaggaggcta caaaagggca gcctagaggg cagaacagag aaaagctaga 
caaaatgata agatcagcca tatttcattt tgaatctgca gtggaaaaaa agcccacatt 
tgaggtggct catctagacc tggcaagaat gtatatagaa gcaggcaatc acagaaaagc 
tgaagagaat tttcaaaaat tgttatgcat gaaaccagtg gtagaagaaa caatgcaaga 
catacatttc tactatggtc ggtttcagga atttcaaaag aaatctgacg tcaatgcaat 
tatccattat ttaaaagcta taaaaataga acaggcatca ttaacaaggg ataaaagtat 
caattctttg aagaaattgg ttttaaggaa acttcggaga aaggcattag atctggaaag 
cttgagcctc cttgggttcg tctacaaatt ggaaggaaat atgaatgaag ccctggagta 
ctatgagcgg gccctgagac tggctgctga ctttgagaac tctgtgagac aaggtcctta 
ggcacccaga tatcagccac tttcacattt catttcattt tatgctaaca tttactaatc 
atcttttctg cttactgttt tcagaaacat tataattcac tgtaatgatg taattcttga 
ataataaatc tgacaaaata tt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1642 



E qx82h04.xl NCI_CGAP_GC6 Homo sapiens cDNA clone IMAGE : 2009047 3', mRNA 
E sequence. 



gcagctaaat taaaatgacc ttttatttgc ctggacaaca aaaattttcc atgattttgc 60 

ttttttgaaa caatgataag aaattttttt ttaggcaata agatactaag ttgtatcaac 12 0 

aaactgcatg ggatatttcc acaaggagag gattttgttc cctgatctag tttacgtgac 180 

attttccctt atgcttgctt tctctgagct gactcttctt aaactgacct agatggtacc 24 0 

ctatttcaac tgactcagag ttcattcaaa aatatgatat ggtgacttgg cttcactgac 3 00 

atgaaatcca ggcactctct ctactcttgc tcacattctt ccttgcccaa ggttccagcg 3 60 

tgattttagg atatcttatg ccaacccagt gtgccgtcac ttctcagaga tgtagggcca 420 



Human interferon- induced cellular resistance mediator protein (MxA) mRNA, 
complete cds . 



/translation^ " IWVSEVDIAKADPAAASHPLLLNGDATVAQKNPGSVAENl^ 
EEKVRPCIDLIDSLRALGVEQDIiMiPAIAVIGDQSSGKSSVIjEAIiSGVALPRGSGIVTR 
CPLVLKIjKKLVNEDKWRGKVSyQDYE IEI SDAS EVEKE INKAQNAIAGEGMGI SHELIT 
RE I S SRDVPDLTTj IDLPG ITRVAVGNQPADIGYKI KTLIKKY I QRQETI SIjVWPSNVD 
IATTEALSMAQEVDPEGDRTIGILTKPDLVDKGTE 

GQQEIQDQIiSLSEALQREKZFFEmPYFRDLLEEGKATVPCIAEKIiTSEIjITHICKSLP 
LLENQ I KETHQRI TEELQKYGVD I PEDENEKMFFLIDKINAFNQDI TALMQGEETVGEE 
DIRLFTRLRHEFHKWSTIIENNFQEGHKILSRKIQKFENQYRGRELPGFVNYRTFETIV 
KQQIKALEEPAVDMIiHTVTDMVRIAFTDVS IKNFEEFFNLHRTAKSKIEDIRAEQEREG 
EKLIRIiHFQMEQI VYCQDQVYRGALQKVREKELEEEKKKKS WDFGAFQS S SATDS SMEE 
IFQHLMAYHQEASKRISSHIPLIIQFFMLQTYGQQLQKAMIjQIjIjQDKDTYSWIjLKERSD 

TSDKRKFLKERIiARLTQARRRLAQFPG n 
Sequence 2651 BP; 732 A; 646 C; 704 G; 569 T; 0 other; 

ggaattctgt ggccatactg cgaggagatc ggttccgggt cggaggctac aggaagactc 60 

ccactccctg aaatctggag tgaagaacgc cgccatccag ccaccattcc aaggaggtgc 120 

aggagaacag ctctgtgata ccatttaact tgttgacatt acttttattt gaaggaacgt 180 

atattagagc ttactttgca aagaaggaag atggttgttt ccgaagtgga catcgcaaaa 240 

gctgatccag ctgctgcatc ccaccctcta ttactgaatg gagatgctac tgtggcccag 300 

aaaaatccag gctcggtggc cgagaacaac ctgtgcagcc agtatgagga gaaggtgcgc 360 

ccctgcatcg acctcattga ctccctgcgg gctctaggtg tggagcagga cctggccctg 420 

ccagccatcg ccgtcatcgg ggaccagagc tcgggcaaga gctccgtgtt ggaggcactg 4 80 

tcaggagttg cccttcccag aggcagcggg atcgtgacca gatgcccgct ggtgctgaaa 540 

ctgaagaaac ttgtgaacga agataagtgg agaggcaagg tcagttacca ggactacgag 600 

attgagattt cggatgcttc agaggtagaa aaggaaatta ataaagccca gaatgccatc 660 

gccggggaag gaatgggaat cagtcatgag ctaatcaccc gtgagatcag ctcccgagat 720 

gtcccggatc tgactctaat agaccttcct ggcataacca gagtggctgt gggcaatcag 780 

cctgctgaca ttgggtataa gatcaagaca ctcatcaaga agtacatcca gaggcaggag 840 

acaatcagcc tggtggtggt ccccagtaat gtggacattg ccaccacaga ggctctcagc 900 

atggcccagg aggtggaccc cgagggagac aggaccatcg gaatcttgac gaagcctgat 960 

ctggtggaca aaggaactga agacaaggtt gtggacgtgg tgcggaacct cgtgttccac 1020 

ctgaagaagg gttacatgat tgtcaagtgc cggggccagc aggagatcca ggaccagctg 1080 

agcctgtccg aagccctgca gagagagaag atcttctttg agaaccaccc atatttcagg 1140 

gatctgctgg aggaaggaaa ggccacggtt ccctgcctgg cagaaaaact taccagcgag 1200 

ctcatcacac atatctgtaa atctctgccc ctgttagaaa atcaaatcaa ggagactcac 1260 

cagagaataa cagaggagct acaaaagtat ggtgtcgaca taccggaaga cgaaaatgaa 1320 

aaaatgttct tcctgataga taaaattaat gcctttaatc aggacatcac tgctctcatg 1380 

caaggagagg aaactgtagg ggaggaagac attcggctgt ttaccagact ccgacacgag 1440 

ttccacaaat ggagtacaat aattgaaaac aattttcaag aaggccataa aattttgagt 1500 

agaaaaatcc agaaatttga aaatcagtat cgtggtagag agctgccagg ctttgtgaat 1560 

tacaggacat ttgagacaat cgtgaaacag caaatcaagg cactggaaga gccggctgtg 1620 

gatatgctac acaccgtgac ggatatggtc cggcttgctt tcacagatgt ttcgataaaa 1680 

aattttgaag agttttttaa cctccacaga accgccaagt ccaaaattga agacattaga 1740 

gcagaacaag agagagaagg tgagaagctg atccgcctcc acttccagat ggaacagatt 1800 

gtctactgcc aggaccaggt atacaggggt gcattgcaga aggtcagaga gaaggagctg 1860 

gaagaagaaa agaagaagaa atcctgggat tttggggctt tccaatccag ctcggcaaca 1920 

gactcttcca tggaggagat ctttcagcac ctgatggcct atcaccagga ggccagcaag 1980 

cgcatctcca gccacatccc tttgatcatc cagttcttca tgctccagac gtacggccag 2040 

cagcttcaga aggccatgct gcagctcctg caggacaagg acacctacag ctggctcctg 2100 

aaggagcgga gcgacaccag cgacaagcgg aagttcctga aggagcggct tgcacggctg 2160 

acgcaggctc ggcgccggct tgcccagttc cccggttaac cacactctgt ccagccccgt 2220 

agacgtgcac gcacactgtc tgcccccgtt cccgggtagc cactggactg acgacttgag 2280 

tgctcagtag tcagactgga tagtccgttc ctgcttatcc gttagccgtg gtgatttagc 2340 

aggaagctgt gagagcagtt tggtttctag catgaagaca gagccccacc ctcagatgca 2400 



catgagctgg cgggattgaa ggatgctgtc ttcgtactgg gaaagggatt ttcagccctc 2460 

agaatcgctc caccttgcag ctctcccctt ctctgtattc ctagaaactg acacatgctg 2520 

aacatcacag cttatttcct catttttata atgtcccttc acaaacccag tgttttagga 2580 

gcatgagtgc cgtgtgtgtg cgtcctgtcg gagccctgtc tctctctctg taataaactc 2 640 

atttctagca g 2651 



Homo sapiens cDNA: FLJ21726 fis, clone COIiF1088 . 

agtgcatgga gacgagaggt gtttctaaag atgggagaaa tgacagcgtg catgtgtgcc 60 

gatgggagtc accccataga gaaggaagaa agcagtgaca gaggagagga ctgctccttg 120 

tccttgagta gttggccaag ggagagacct cctgcacaaa tggagggttt ggcctcacgc 180 

agaaagaagc acacttggtt catccctggc agcaggaggg aaggcgtggg tgtagggaac 240 

agggcgtgtg gaggggatct tttgggtgct cttattttct cagtgaaata caggacgcaa 300 

gagcagcagt ggacggtgag aatggggatg ttcccatcca gctttcaggg tcccatgtga 360 

tagtgccccg tggctggcct gtgttctggg gacagtcact ggccacatgc actgcagggc 420 

atcaggcagc agaggctgcc ttgggcagga cagagacagg cccgccaact aatgtgcccc 480 

tttttgcctc tgcctccagg actgtccaac ccattccggg gtctcatgaa gctgggcacc 540 

gtggagcggc ggggggcaat gggcatctgg aaggagctct tctgcgagct ctccccgctg 600 

gagttccgcc tctacctgag caacgaggag cacacctgtg tggagaactg ctcgctgctt 660 

cgctgtgagt ctgtggggcc agcccatagt gatgggcgct ttgagctggt cttctctggc 720 

aagaagctgg ccctgcgcgc ctcctcccag gacgaagctg aggactggct ggaccgggtg 780 

cgggaggccc tgcagaaggt ccggcctcag caggaggatg agtgggtgaa cgtgcagtac 840 

ccagaccagc ctgaggaacc ccccgaggcg ccccagggct gcctctctcc ctcagacctg 900 

ctctcggagc ccgcggccct ccagggcaca cagtttgact ggtcgtccgc ccaggttcca 960 

gagccagatg ccatcaagga gtccctgctg tacttgtaca tggacaggac ctggatgccc 1020 

tatatatttt ctctgtcctt ggaggctctg aaatgtttcc gcatcaggaa caatgagaag 1080 

atgctgagtg acagccacgg cgtggagacc atccgggaca tcctgccaga caccagcctt 114 0 

gggggcccat ccttcttcaa aatcatcacg gccaaggctg tcctgaagct gcaggccgga 1200 

aacgccgagg aagccgccct gtggagggat ctggtccgca aagtcctggc atcctacttg 1260 

gagacagccg aggaggcggt gaccctgggc gggagcctgg atgaaaactg tcaggaggtg 132 0 

ctgaaatttg ccacccggga gaatggcttc ctgctgcagt acctggtggc tatccccatg 1380 

gagaaaggcc ttgactccca aggctgcttc tgcgcaggtg ccgatttgct ctgctgccac 1440 

ccccagcctg ccagcctcac tccacctcct gctggttcct gatttaggct ccccaccctt 1500 

ctgcctcccc gcaaatgccc ccatccttcc cctagggatg aggccacaga tcaggcttgc 1560 

cctacagctt ctgctcctcc ccagccctgg ctggggccag tgccctgctc ataggcagtg 162 0 

ggccctgctc acccgtccct ctcctgccac ctcccactga tgggcggcag gctggctact 1680 

cactgcgctg ctcagggagt cccagcctgc ttcattttct tcttgctcta ccgtcctgtt 1740 

ctttcagagc aggggcatgg tttccttcca aatatttctg ctgcttttat aagtgtacac 1800 

cctttttttt aattataaaa atgggctcgt gctaaaaaaa aaaaaaaaaa aaaaaaaaa 1859 



E xw86ell.xl NCI_CGAP_Panl Homo sapiens cDNA clone IMAGE : 2834924 3» mRNA 
E sequence. 



ttataagaaa tttatttttt cacagataca gaacataaat ccaagaaaaa ttattattat 60 

ttttcacaat tatgactaaa tcatgttatt tctagttatt tacaagtact acaatgttct 120 

atgcatttct tcatcctaga cattaataaa acacatccct ttggtcttag atacttctct 180 

ttggtctgtg ttttctcctt tctgaatttt aatcttctgt gatgtgagga aatttacgtg 24 0 

aacctttcac atatctattt ttttccttgt gcacagttga taatttcctc ccttagattc 3 00 

cctgagaaaa gaaacacaaa atattcttag tggattatct caggaaaggc aaccagaggg 360 

aagaggaata ttggaccact gaaaatctca accaacgcta atattaggag cacacgtacc 420 

atgaggaaga gaagggatgg ggaaaccaag atggcagagt tagagcaaca aagttagtaa 480 
catgagagtt tcccagcaat ttgagtaaga - 



Human 71 kDa 2 '5' oligoadenylate synthetase (p69 2-5A synthetase) mRNA, 
complete cds. 

/ trans lat ion= "MGNGESQLSSVPAQKLGWFIQEYIiKPYEECQTIjIDEKVTNTICDVC 

RNPEQFPLVQGVAIGGSYGRKTVIjRGNSDGTLVLFFSDIjKQFQDQKRSQRDIIiDKTGDK 

LKFCLFTKWLKNNFEIQKSLDGSTIQVFTKNQRISFEVI^ 

KRSLDKTNASPGEFAVCFTEIjQQKFFDNRPGKLKDIiIIjIilKHW^ 

YALEIjIjTVYAWEQGCRKDNFDIAEGVRTVLELIKC^^ 

HQIjQSARPVIIjDPVDPTNNVSGDKICWQWIiKKEAQTWLTSPNIjD 

FTTPGHLLDKFIKEFLQPNKCFIiEQIDSAVNIIRTFIjKENCFRQSTAKIQIVRGGSTAK 
GTAIiKTGSDADIjVVFHNSIjKSYTSQKNERHKIVKEIHEQLKAFWREKEEELEVSFEPPK ■ 
WKAPRVLS FSIjKS KVLNE SVSFDVIjPAFNALGQLS SGSTPSPEVYAGIj idlyks sdlpg 
GEFSTCFTVXiQRNFIRSRPTKLKDLIRLV^ 

WEQGSGVPDFDTAEGFRT\n^LVTQYQQLGI FWKVNYNFEDETVRKFI»LS QIjQKTRPVI 
IJDPGEPTGDVGGGDRWCWHIiIjDKEAKVRL^ 
IHPIVNEMFSSRSHRIIiNNNSKRNFWRSSGNRF " 

cggcagccag ctgagagcaa tgggaaatgg ggagtcccag ctgtcctcgg tgcctgctca 60 

gaagctgggt tggtttatcc aggaatacct gaagccctac gaagaatgtc agacactgat 120 

cgacgagatg gtgaacacca tctgtgacgt ctgcaggaac cccgaacagt tccccctggt 180 

gcagggagtg gccataggtg gctcctatgg acggaaaaca gtcttaagag gcaactccga 240 

tggtaccctt gtccttttct tcagtgactt aaaacaattc caggatcaga agagaagcca 300 

acgtgacatc ctcgataaaa ctggggataa gctgaagttc tgtctgttca cgaagtggtt 360 

gaaaaacaat ttcgagatcc agaagtccct tgatgggtcc accatccagg tgttcacaaa 42 0 

aaatcagaga atctctttcg aggtgctggc cgccttcaac gctctgagct taaatgataa 480 

tcccagcccc tggatctatc gagagctcaa aagatccttg gataagacaa atgccagtcc 540 

tggtgagttt gcagtctgct tcactgaact ccagcagaag ttttttgaca accgtcctgg 600 

aaaactaaag gatttgatcc tcttgataaa gcactggcat caacagtgcc agaaaaaaat 660 

caaggattta ccctcgctgt ctccgtatgc cctggagctg cttacggtgt atgcctggga 720 

acaggggtgc agaaaagaca actttgacat tgctgaaggc gtcagaacgg ttctggagct 780 

gatcaaatgc caggagaagc tgtgtatcta ttggatggtc aactacaact ttgaagatga 840 

gaccatcagg aacatcctgc tgcaccagct ccaatcagcg aggccagtaa tcttggatcc 900 

agttgaccca accaataatg tgagtggaga taaaatatgc tggcaatggc tgaaaaaaga 960 

agctcaaacc tggttgactt ctcccaacct ggataatgag ttacctgcac catcttggaa 1020 

tgtcctgcct gcaccactct tcacgacccc aggccacctt ctggataagt tcatcaagga 1080 

gtttctccag cccaacaaat gcttcctaga gcagattgac agtgctgtta acatcatccg 1140 

tacattcctt aaagaaaact gcttccgaca atcaacagcc aagatccaga ttgtccgggg 1200 

aggatcaacc gccaaaggca cagctctgaa gactggctct gatgccgatc tcgtcgtgtt 1260 

ccataactca cttaaaagct acacctccca aaaaaacgag cggcacaaaa tcgtcaagga 1320 

aatccatgaa cagctgaaag ccttttggag ggagaaggag gaggagcttg aagtcagctt 1380 

tgagcctccc aagtggaagg ctcccagggt gctgagcttc tctctgaaat ccaaagtcct 1440 

caacgaaagt gtcagctttg atgtgcttcc tgcctttaat gcactgggtc agctgagttc 1500 

tggctccaca cccagccccg aggtttatgc agggctcatt gatctgtata aatcctcgga 1560 

cctcccggga ggagagtttt ctacctgttt cacagtcctg cagcgaaact tcattcgctc 1620 

ccggcccacc aaactaaagg atttaattcg cctggtgaag cactggtaca aagagtgtga 1680 

aaggaaactg aagccaaagg ggtctttgcc cccaaagtat gccttggagc tgctcaccat 174 0 

ctatgcctgg gagcagggga gtggagtgcc ggattttgac actgcagaag gtttccggac 1800 

agtcctggag ctggtcacac aatatcagca gctcggcatc ttctggaagg tcaattacaa 1860 

ctttgaagat gagaccgtga ggaagtttct actgagccag ttgcagaaaa ccaggcctgt 1920 

gatcttggac ccaggcgaac ccacaggtga cgtgggtgga ggggaccgtt ggtgttggca 1980 

tcttctggac aaagaagcaa aggttaggtt atcctctccc tgcttcaagg atgggactgg 2040 

aaacccaata ccaccttgga aagtgccgac aatgcagaca ccaggaagtt gtggagctag 2100 

gatccatcct attgtcaatg agatgttctc atccagaagc catagaatcc tgaataataa 2160 

ttctaaaaga aacttctgga gatcatctgg caatcgcttt taaagactcg gctcaccgtg 2220 

agaaagagtc actcacatcc attcttccct tgatggtccc tattcctcct tcccttgcct 2280 

tcttggactt cttgaaatca atcaagactg caaacccttt cataaagctg ccttgctgaa 2340 

ctcctctctg caggagccct gcttaaaata gttgatgtca tcactttatg tgcatcttat 2400 



ttctgtcaac ttgtattttt ttttcttgta tttttccaat tagctcctcc tttttccttc 2460 

cagtctaaaa aaggaatcct ctgtgtcttc aaagcaaagc tctttacttt ccccttggtt 252 0 

ctcataactc tgtgatcttg ctctcggtgc ttccaactca tccacgtcct gtctgtttcc 2580 

tctgtataca aaaccctttc tgcccctgct gacacagaca tcctctatgc cagcagccag 264 0 

gccaaccctt tcattagaac ttcaagctct ccaaaggctc agattataac tgttgtcata 2700 

tttatatgag gctgttgtct tttccttctg agcctgcctt tatcccccca cccaggagta 2760 

tcctcttgcc aaagcaaaag actttttcct tggctttagc cttaaagata cttgaaggtc 282 0 

taggtgcttt aacctcacat accctcactt aaacttttat cactgttgca tataccagtt 2880 

gtgatacaat aaagaatgta tctgg " " 2905 



Homo sapiens cDNA FLJ20035 fis, clone COL00213. 

/translations "MSGRAGRRGQDLMGDVYFFDIPFPKIGKLIKSNVPEIjRGHFPIjSI 
TLVLRLMLLASKGDDPEDAKAKV^ 

IiDQEGNPMGFAGLVSHIjHYHEPSNIjVFVSFIjWGLFHDLCQPTR^ 

IjVIAHIjFGRRYFPPKFQDAHFEFYQSKVFIiDDLPEDFSDAIjDEYNMKIMEDFTTFLRIV 
SKLADMNQEYQLPLS KI KFTGKECEDS QLVSHLMS CKEGRVAI S PFVCLS GNFDDDLIiR 
LETPNHVTLGT IGVNRS QAPVLIjSQKFDITOGRKMSIjNAYALDFYKHG SL IGLVQ 
EGD AYYLLKDFALTI KS I S VSLRELCENEDDNVVLAFEQLSTTF WEKLNKV n 

Sequence 1906 BP; 626 A; 327 C; 359 G; 594 T; 0 other; 

aatctgtggt ttttgctcaa aactcagtct atctggatgc gttgaattat agacagatgt 60 

ctggccgtgc tggaagaaga ggtcaagacc tgatgggaga tgtatatttc tttgatattc 120 

cattccccaa aataggaaaa ctcataaaat ccaatgttcc tgagctgaga ggacacttcc 180 

ctctcagcat aaccctggtc ctgcgactca tgctgctggc ttccaaggga gatgacccag 240 

aggatgccaa ggcaaaggtg ctatcagtgc taaagcattc attgctgtcc ttcaagcaac 3 00 

ccagagtcat ggacatgtta aaactttact tcctgttttc tttgcagttc ctggtgaaag 360 

agggctattt agatcaagaa ggtaatccta tggggtttgc tggacttgtg tcacatttgc 420 

attatcatga accttctaat cttgtttttg tgagttttct tgtaaatggc ctcttccatg 480 

atctctgtca gccaaccagg aaaggctcaa aacatttttc tcaagacgtt atggaaaagc 540 

tagtattagt attggcacat ctctttggaa gaagatattt tccaccaaag ttccaggatg 600 

cacacttcga gttttatcaa tcaaaggtgt tccttgatga tctccctgag gattttagtg 660 
atgctttaga tgaatataac atgaaaatta tggaggactt taccactttc ctacgaattg 720 
tttccaaact ggctgatatg aatcaggaat atcaactccc attgtcaaaa atcaaattca 780 
caggtaaaga atgtgaagac tctcaactcg tatctcattt gatgagctgc aaggaaggaa 840 
gagtagcaat ttcaccattt gtttgtctgt ctgggaactt tgatgatgat ttgcttcgac 900 
tagaaactcc aaaccatgtt actctaggca caatcggtgt caatcgctct caggctccag 960 

tgctgttgtc acagaaattt gataaccgag gaaggaaaat gtcgcttaat gcctatgcac 1020 

tggatttcta caaacatggt tccttgatag gattagtcca ggataacagg atgaatgaag 1080 

gagatgctta ttatttgttg aaggattttg cactcaccat taaatctatc agtgtttcct 1140 

tgcgtgagct atgtgaaaat gaagacgaca acgttgtctt agcctttgaa caactgagta 1200 

caactttttg ggaaaagtta aacaaagtct aaaaacaaag tctatgcaaa ccactcaaaa 1260 

ataattccat agtagttttt caggtcacgt ttttgattct tatgcttctt gccagaaata 1320 

cattatgata aagtggaaat acattacgat gaagtggaaa gagcaaacac tttggaatca 1380 

aacagagttg caatcaaacc tgccatgttc tgtcatgaat actcacaaat tatttagtat 1440 

acctgaatct tggtttcttt ttataactga gtaataatgg ttacatctca ggtagtttga 1500 

ggattgacta aaaaaatgcg agaatgttgt atgtgactga ataacaattt ttactctgcg 1560 

aagccaaagt aaatataata ttatcagtaa ctttatcccc agtgtcagta tttataaaat 1620 

gtttattaag gctagaaaaa atgaatacaa tatcctgaag gtgaaatata ttctcttcaa 1680 

ttagcataaa tatgatttac ataagttagc tatacagcta ttgagatagt actttctagt 1740 

aaacttaaac tactttttaa acatacattt tgtgttgatt taacaaaaat atagagaatg 1800 

atttgcttta ttgtaattgt atataagtga ctggaaaagc acaaagaaat aaagtgggtt I860 

cgatctgttt accaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 1906 



>E Homo sapiens monocarboxylate transporter 2 (hMCT2) mRHA, complete 



Q Sequence 2104 BP; 602 A; 400 C; 447 G; 654 T; 1 other; 

ggaaacttct gcctcaggtg gggagaggag tccatagatc agggaaactt atgtcttggt 
gaaatggaag accatgtttc taaacacctg tcgccaggtt acttgaattt ccactagagg 
agcagaaatg ccaccaatgc caagtgcccc acctgtgcat ccacctccag atggaggatg 
gggttggatt gtggttggag cagcttttat ctccattgga ttttcctatg cattccccaa 
agctgtcacc gtattcttca aagaaattca gcaaatattc cacactacct acagtgaaat 
agcatggatt tcatccatta tgctggctgt tatgtacgca ggaggtcctg taagtagtgt 
tttggtgaat aaatacggca gccggccggt ggtgatagca ggaggcttat tatgctgtct 
tggaatggtg ttggcctcct ttagtagcag cgtggtacag ctgtacctca ctatgggatt 
cattacaggt ttaggtttag ccttcaacct gcaacccgcc ttaaccataa ttggcaaata 
cttctatagg aagcgaccca tggcaaatgg attggccatg gcaggaagtc ctgttttctt 
aagttcattg gctcctttca atcagtacct ttttaatact tttggctgga aaggaagctt 
cctgattttg ggaagtctac ttttgaatgc ctgtgtggct ggttccctca tgagacccct 
tggacccaat caaaccactt ctaagtctaa aaataagact ggcaaaacag aagatgattc 
aagcccaaag aaaatcaaaa cgaagaaatc aacttgggaa aaagttaata agtatttaga 
tttctccctt tttaagcata gaggatttct gatatatctg tctggaaatg tcattatgtt 
cctaggtttt tttgccccca ttatattctt ggctccatat gctaaagacc aaggaattga 
tgagtactcg gcagcttttc tgctatctgt tatggctttc gttgatatgt ttgctaggcc 
ttctgtagga ttaattgcaa actccaaata tattcgacct cgaattcagt acttcttcag 
ttttgcaatc atgttcaatg gagtgtgtca cctcttgtgc ccactggcac aggactacac 
aagcctggta ttatatgctg tattttttgg ccttggattt gggagtgtta gcagtgttct 
ctttgaaact ctcatggacc tcgtgggtgc accaagattt tccagtgccg tcggacttgt 
cacaattgtg gagtgtggcc cagttcttct tggccctcct cttgcaggta aattggtgga 
tttaactgga gaatataaat acatgtacat gtcctgtggg gctattgtgg tagcagcaag 
cgtgtggctg ctcattggca atgctatcaa ctatagattg cttgcaaagg aaaggaagga 
ggaaaatgca aggcagaagt ccagagaatc tgaacccttg agcaaatcta aacattcgga 
agatgttaac gtcaaagttt caaatgcaca gagtgtaacc tcagaaagag aaactaacat 
ttaacaagaa tcacatctct gatttcagtg tttatgactt tatctaggag tttgtttttc 
attttgtttt tttaaagtat tagaaaaggt tttagctgaa atgaggagtc acaattdaag 
gatggaggtg atattttcct caatggcaaa ttttaaatta gtttttaaaa acttacttat 
ttgggtagtt aaattttgag attatgcata gaaagaatcc atgctatagg tttatttcca 
tacctgactc tgggtgtggt ggttaaaata ctaattttaa agtcttccag tgactttcgg 
tcttggttat atggagaatt ctgaatccca aacccctggt ttaagtaggt agaaggagga 
tgcctaatcc tacaaagtga ccctttatac atttcatttt ttatttgata ttaaagtatg 
agatagagtt gagagacaat taattatccc ctcttacaca caaacacaca tactcccaca 
tacttaccca catgtacaca gagtatctgg agaataaaac ccaaattcaa aaaaaaaaaa 
aaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2104 
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/ trans lat ion= "MPPITOSAPPVHPPPDCKSWGW 
EIQQIFHTTYSEIAWISSIMI^VMYAGGPVSSVL^ 

PS S S WQLYLTMGFI TGLGLAFNLQPALTI IGKYFYRKRPMANGIiAMAGS PVFLS SLAP 
FNQ YLFNTFGWKGS FL ILGS LLLiNACVAGSLMRPLGPNQTTS KS KNKTGKTEDD SSPKK 
IKTKKSTWEKVNKYIjDFSLFKHRGFLIYLSGNVIMFLGFFAPIIFIjAPYAKDQGIDEYS 
AAFLLSVMAFVDMFARPSVGLIANSKYIRPRIQYFFSFAIMFNGVCHIjLCPLAQDYTSIj 
VL YAVFFGLGFGS VS S VLFETLMDLVGAPRFS S AVGLVT I VECGPVIiLGPPLAGKLVDIj 

tgeykymymscgaiwaasvwllignain^ 
dvnvkvsnaqsvtseretni " 




Homo sapiens interferon -induced protein 44, mRNA (cDNA clone MGC: 24007 



/ trans 1 at ion= " MAVTTRLTRIiHEKILQNHFGGKRL S LJj YKG S VHGFRNGVLIiDRC C 
NQGPTLTVI YSEDHI IGAYAEESYQEGKYAS I IIiFALQDTKI SEWKLGIiCTPETLFCCD 
VTKYNS PTNFQIDGRWBKVIMDIjKTMENLGIiAQNCT I S IQDYE VFRCEDSLDERKI KGV 
IELRKSLLS ALRTYE PYGS LVQQIRILLLGP IGAGKS S FFNS VRS VFQGHVTHQALVGT 
NTTGI SEKYRTYS IRDGKDGKYLPFILCDSIiGLSEKEGGLCRDDIFYIIjNGNTRDRYQF 
NPMESIKLiraHDYIDSPSLKDRIHCVAFVFDASSIQY^ 
HVALLTHVDSl^LITKGDLIEIERC 

PVKDVIilLSALRRMLWAADDFLEDLPFEQIGNIiREEIINCAQGKK" 



ggggcatttt gtgcctgcct agctatccag acagagcagc taccctcagc tctagctgat 
actacagaca gtacaacaga tcaagaagta tggcagtgac aactcgtttg acacggttgc 
acgaaaagat cctgcaaaat cattttggag ggaagcggct tagccttctc tataagggta 
gtgtccatgg attccgtaat ggagttttgc ttgacagatg ttgtaatcaa gggcctactc 
taacagtgat ttatagtgaa gat cat at t a ttggagcata tgcggaagag agttaccagg 
aaggaaagta tgcttccatc atcctttttg cacttcaaga tactaaaatt tcagaatgga 
aactaggact atgtacacca gaaacactgt tttgttgtga tgttacaaaa tataactccc 
caactaattt ccagatagat ggaagaaata gaaaagtgat tatggactta aagacaatgg 
aaaatcttgg acttgctcaa aattgtacta tctctattca ggattatgaa gtttttcgat 
gcgaagattc actggatgaa agaaagataa aaggggtcat tgagctcagg aagagcttac 
tgtctgcctt gagaacttat gaaccatatg gatccctggt tcaacaaata cgaattctgc 
tgctgggtcc aattggagct gggaagtcca gctttttcaa ctcagtgagg tctgttttcc 
aagggcatgt aacgcatcag gctttggtgg gcactaatac aactgggata tctgagaagt 
ataggacata ctctattaga gacgggaaag atggcaaata cctgccgttt attctgtgtg 
actcactggg gctgagtgag aaagaaggcg gcctgtgcag ggatgacata ttctatatct 
tgaacggtaa cattcgtgat agataccagt ttaatcccat ggaatcaatc aaattaaatc 
atcatgacta cattgattcc ccatcgctga aggacagaat tcattgtgtg gcatttgtat 
ttgatgccag ctctattcaa tacttctcct ctcagatgat agtaaagatc aaaagaattc 
gaagggagtt ggtaaacgct ggtgtggtac atgtggcttt gctcactcat gtggatagca 
tggatttgat tacaaaaggt gaccttatag aaatagagag atgtgagcct gtgaggtcca 
agctagagga agtccaaaga aaacttggat ttgctctttc tgacatctcg gtggttagca 
attattcctc tgagtgggag ctggaccctg taaaggatgt tctaattctt tctgctctga 
gacgaatgct atgggctgca gatgacttct tagaggattt gccttttgag caaataggga 
atctaaggga ggaaattatc aactgtgcac aaggaaaaaa atagatatgt gaaaggttca 
cgtaaatttc ctcacatcac agaagattaa aattcagaaa ggagaaaaca cagaccaaag 
agaagtatct aagaccaaag ggatgtgttt tattaatgtc taggatgaag aaatgcatag 
aacattgtag tacttgtaaa taactagaaa taacatgatt tagtcataat tgtgaaaaat 
agtaataatt tttcttggat ttatgttctg tatctgtgaa aaaataaatt tcttataaaa 
ctcggaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1714 



»E 601067066F1 NIH_MGC_10 Homo sapiens cDNA clone IMAGE : 3453257 5', mRNA 
»E sequence. 



aaatctcaag acacattcac aaacaaatgg ttatcaccaa ggtcttcatg ctctactcat 60 

gttgacatga gttgtattaa ttggtgactg gaagtccagg atctgttgag gaagtcagtg 120 

acccttaaat tcaggaacac tgccttggaa ggtggtggac ctttaaaaca gaagcttctc 180 

agttttgtag catctgatat gagagaatat gctagatatt cataaactta gggccaggca 240 

atgtggggcc cctggaatgc tactgggcac tctctaacct agtcctagaa atttcagttc 300 

caataatgtt ttcttcttct tttctagata gaaactatat gtatctcgtg gatctgccag 360 

taccagcctt ccaaatgaaa ctctttcaga gttagagaca cctggaaata ctcacttaca 420 

ccaccaaaca ctggggccac cacatcgata cctgcagcat ctttagtcaa gttggaggag 480 

aaagacaaca cttggtctaa gacacggcag caagacatcc ctgcatatgt tccagataaa 54 0 

aatgaaagct gtcacaccca cttgcctccc caatctgtta aacagcttcg tgtctagtat 600 

gagctcagta ctttgcctgt gaaaatccca gaagcccccg ctgtcaatgg ttccccatcc 660 

aaccctgttt gctcctgtgt aacagtcaga tgatgactaa taataaaact gtactttttg 720 

gaaaaaaaca aaaggggggc ggcaaaagac cccgag " 756 



Human glutamate receptor subunit (GluHl) mRNA, complete cds. 
glutamate receptor subunit. 



/trans 1 at ion= "MQHI FAFFCTGFLGAVVGANFPNNI Q IGGLFPNQQS QEHAAFRF A 
LSQLTEPPKLIiPQIDIVNISDSFEMTYRFCSQFSKGVyAIFGFYERRTVNMIjTSFCGAIj 
HVCFITPSFPVDTSNQFVLQIiRPEIiQDALI S I IDHYKWQKFVYI YDADRGLSVLQKVLI) 
TAAEKIWQVTAVNILTTTEEGYRMLFQDIjEKKKE^ IKLEKN 
GIGYHYIIjANIjGFMDIDIjNKFKESGANVTGFQLVNYTDT^ 

DWKRPKYTSALTYIXSVKVMAET^QSLRRQRIDISRRGNAGDCIjANPAVPWGQGIDIQRA 
LQQVRFEGIiTGNVQFNEKGRRTNYTLHyiEMKHDGIRKIGY^ 

DNS S VQl^TYI VTTILEDPYVMLKKNANQFEGNDRYEGYCVEIjAAE IAKHVGYS YRLE I 
VSDGKYG7JU)PDTKAWNG^^GELVYGRADVAVAPLTITIJVREEVIDFSKPFMSLGI S IM 

I KKPQKS KPGVFS FLDPLAYE I WMC IVFAYIGVSWI1FI1VSRFS PYEWHSEEFEEGRDQ 
TTSDQSNEFGIFNSLWFSLGAFMQQGCDISPRSLSGRIVGGVWWFFTLIIISSYTANLA 

AFLTVERMVS P IES AEDIiAKQTE I AYGTLEAGSTKEF FRRSKI AVFEKMWTYMKS AEP S 
VFVRTTEEGMIRVRKSKGKYAYI1LESTMNEYIEQRKPCDTMKVGGNI1DSKGYGIATPKG 

SALRGPVNIAVLKLSEQGVLDKLKSKWW^ 

LIGGIiGLAMLVAIiIEFCYKSRSESKRMKGFCLIPQQSINEAIRTSTLPKNSAGTAPSSG 
GSGENGRWSHD FPKSMQS I PCMSHS SGMPLGATGL n 



Sequence 3178 BP; 840 A; 741 C; 837 G; 760 T; 0 other ; 
ggtaaaggga aaggggggga aacaccaaat ctatgattgg acctgggctt ctttttcgcc 
aatgcaaaaa ggaatatgca gcacattttt gccttcttct gcaccggttt cctaggcgcg 
gtagtaggtg ccaatttccc caacaatatc cagatcgggg gattatttcc aaaccagcag 
tcacaggaac atgctgcttt tagatttgct ttgtcgcaac tcacagagcc cccgaagctg 
ctcccccaga ttgatattgt gaacatcagc gacagctttg agatgaccta tagattctgt 
tcccagttct ccaaaggagt ctatgccatc tttgggtttt atgaacgtag gactgtcaac 
atgctgacct ccttttgtgg ggccctccac gtctgcttca ttacgccgag ctttcccgtt 
gatacatcca atcagtttgt ccttcagctg cgccctgaac tgcaggatgc cctcatcagc 
atcattgacc attacaagtg gcagaaattt gtctacattt atgatgccga ccggggctta 
tccgtcctgc agaaagtcct ggatacagct gctgagaaga actggcaggt gacagcagtc 
aacatcttga caaccacaga ggagggatac cggatgctct ttcaggacct ggagaagaaa 
aaggagcggc tggtggtggt ggactgtgaa tcagaacgcc tcaatgctat cttgggccag 
attataaagc tagagaagaa tggcatcggc taccactaca ttcttgcaaa tctgggcttc 
atggacattg acttaaacaa attcaaggag agtggcgcca atgtgacagg tttccagctg 
gtgaactaca cagacactat tccggccaag atcatgcagc agtggaagaa tagtgatgct 
cgagaccaca cacgggtgga ctggaagaga cccaagtaca cctctgcgct cacctacgat 
ggggtgaagg tgatggctga ggctttccag agcctgcgga ggcagagaat tgatatatct 
cgccggggga atgctgggga ttgtctggct aacccagctg ttccctgggg ccaagggatc 
gacatccaga gagctctgca gcaggtgcga tttgaaggtt taacaggaaa cgtgcagttt 
aatgagaaag gacgccggac caactacacg ctccacgtga ttgaaatgaa acatgacggc 
atccgaaaga ttggttactg gaatgaagat gataagtttg tccctgcagc caccgatgcc 
caagctgggg gcgataattc aagtgttcag aacagaacat acatcgtcac aacaatccta 1320 
gaagatcctt atgtgatgct caagaagaac gccaatcagt ttgagggcaa tgaccgttac 1380 
gagggctact gtgtagagct ggcggcagag attgccaagc acgtgggcta ctcctaccgt 1440 
ctggagattg tcagtgatgg aaaatacgga gcccgagacc ctgacacgaa ggcctggaat 
ggcatggtgg gagagctggt ctatggaaga gcagatgtgg ctgtggctcc cttaactatc 
actttggtcc gggaagaagt tatagatttc tccaaaccat ttatgagttt ggggatctcc 
atcatgatta aaaaaccaca gaaatccaag ccgggtgtct tctccttcct tgatcctttg 
gcttatgaga tttggatgtg cattgttttt gcctacattg gagtgagtgt tgtcctcttc 
ctggtcagcc gcttcagtcc ctatgaatgg cacagtgaag agtttgagga aggacgggac 
cagacaacca gtgaccagtc caatgagttt gggatattca acagtttgtg gttctccctg I860 
ggagccttca tgcagcaagg atgtgacatt tctcccaggt ccctgtctgg tcgcatcgtt 1920 
ggtggcgtct ggtggttctt caccttaatc atcatctcct catatacagc caatctggcc 1980 
gccttcctga ccgtggagag gatggtgtct cccattgaga gtgcagagga cctagcgaag 2040 
cagacagaaa ttgcctacgg gacgctggaa gcaggatcta ctaaggagtt cttcaggagg 2100 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1440 
1500 
1560 
1620 
1680 
1740 
1800 



tctaaaattg ctgtgtttga gaagatgtgg acatacatga agtcagcaga gccatcagtt 2160 

tttgtgcgga ccacagagga ggggatgatt cgagtgagga aatccaaagg caaatatgcc 2220 

tacctcctgg agtccaccat gaatgagtac attgagcagc ggaaaccctg tgacaccatg 22 80 

aaggtgggag gtaacttgga ttccaaaggc tatggcattg caacacccaa ggggtctgcc 2340 

ctgagaggtc ccgtaaacct agcggttttg aaactcagtg agcaaggcgt cttagacaag 2400 

ctgaaaagca aatggtggta cgataaaggg gaatgtggaa gcaaggactc cggaagtaag 2460 

gacaagacaa gcgctctgag cctcagcaat gtggcaggcg tgttctacat cctgatcgga 2520 

ggacttggac tagccatgct ggttgcctta atcgagttct gctacaaatc ccgtagtgaa 2580 

tccaagcgga tgaagggttt ttgtttgatc ccacagcaat ccatcaacga agccatacgg 2640 

acatcgaccc tcccccgcaa cagcgcgggc acggcaccga gcagcggcgg cagtggagag 2700 

aatggtcggg tggtcagcca tgacttcccc aagtccatgc aatcgattcc ttgcatgagc 2760 

cacagttcag ggatgccctt gggagccacg ggattgtaac tggagcagat ggagacccct 2820 

tggggagcag gctcgggctc cccagcccca tcccaaaccc ttcagtgcca aaaacaacaa 2880 

caaaatgaaa cgcaaccacc accaaccact gcgaccacaa gaaggatgat tcaacaggtt 2940 

ttcctgaaga attgaaaaac cattttgctg tcccttttcc ttttttgatg ttctttcacc 3000 

cttttctgtt tgctaagtga ggatgaaaaa ataacactgt actgcaataa ggggagagta 3060 

accctgtcta atgaaacctg tgtctctgag agtagagtca ctggaacact aatgaggaaa 3120 

ctgcactgtt ttattttaat tcagttgtta gtgtgtctta gtgtgtgcaa tttttccc 3178 



zn32e02.sl Stratagene endothelial cell 937223 Homo sapiens cDNA clone 
IMAGE: 549146 3 ? , mRNA sequence. 

cagtaataat cagaacaata tttattttta tatttaanat tcatagaaaa gtgccttaca 60 

tttaataaaa gtttgtttct caaagtgatc agaggaatta gatatagtct tgaacaccaa 120 

tattaatttg aggaaaatac accaaaatac attaagtaaa ttatttaaga tcatagagct 180 

tgtaagtgaa aagataaaat ttgacctcag aaactctgag cattaaaaat ccactattag 240 

caaataaatt actatggact tcttgcttta attttgtgat gaatatgggg tgtcactggt 3 00 

aaaccaacac attctgaagg atacattact tagtgataga ttcttatgta ctttgctaga 360 

taacatggat atgagttgac aagtttctct ttcttcaatc ttttaagggg cagaggaaat 420 

gaggaagaaa agaaaaggaa ttacagcaat actggttcct tcctatagga aggattagat 480 

atgtttcctt tgccaaatat aaaaanaatt aataatggtt accaccagtg aacccnaggt 540 

attagggaaa taatggtcca gcatcncttg ccagaaaggg gtaagatggt tatgggtgaa 600 

601 

c 



E Homo sapiens mRNA expressed in osteoblast, complete cds. 
K 

r /translation= " MVERCSRQGCTITMAYIDYNMIVAFMLGNYINIjRESSTEPNDSLW 

r FSLQKKNDTTEIETLLIjNTAPKI ideqlvcrlsktdifiicrdnkiyldkmitrnlklr 

r FYGHRQYLECEVFRVEGIKDNLDDIKRIIKARE^ 

T VGP VGSGKS S FFNS VKS I FHGHVTGQAWGSDTTS I TERYRI YS VKDGKNGKSLPFMLC 

T DTMGIiDGAEGAGLCJMDDIPHILKGC^PDRYQFNSRKPITPEHSTFITSPSLKDRIHCVA 

T YVIJDINSIDNLYSKMIiAKVKQVHKEVLNCGIA^ 

r SQSRVMBTVHKMIiG^^ 

T EETGAIERALQPCI" 



gcacgaggaa gccacagatc tcttaagaac tttctgtctc caaaccgtgg ctgctcgata 60 

aatcagacag aacagttaat cctcaattta agcctgatct aacccctaga aacagatata 120 

gaacaatgga agtgacaaca agattgacat ggaatgatga aaatcatctg cgcaactgct 180 

tggaaatgtt tctttgagtc ttctctataa gtctagtgtt catggaggta gcattgaaga 240 

tatggttgaa agatgcagcc gtcagggatg tactataaca atggcttaca ttgattacaa 3 00 

tatgattgta gcctttatgc ttggaaatta tattaattta cgtgaaagtt ctacagagcc 360 

aaatgattcc ctatggtttt cacttcaaaa gaaaaatgac accactgaaa tagaaacttt 420 

actcttaaat acagcaccaa aaattattga tgagcaactg gtgtgtcgtt tatcgaaaac 480 

ggatattttc attatatgtc gagataataa aatttatcta gataaaatga taacaagaaa 540 

cttgaaacta aggttttatg gccaccgtca gtatttggaa tgtgaagttt ttcgagttga 600 

aggaattaag gataacctag acgacataaa gaggataatt aaagccagag agcacagaaa 660 

taggcttcta gcagacatca gagactatag gccctatgca gacttggttt cagaaattcg 720 

tattcttttg gtgggtccag ttgggtctgg aaagtccagt tttttcaatt cagtcaagtc 780 

tatttttcat ggccatgtga ctggccaagc cgtagtgggg tctgatacca ccagcataac 840 

cgagcggtat aggatatatt ctgttaaaga tggaaaaaat ggaaaatctc tgccatttat 900 

gttgtgtgac actatggggc tagatggggc agaaggagca ggactgtgca tggatgacat 960 

tccccacatc ttaaaaggtt gtatgccaga cagatatcag tttaattccc gtaaaccaat 1020 

tacacctgag cattctactt ttatcacctc tccatctctg aaggacagga ttcactgtgt 1080 

ggcttatgtc ttagacatca actctattga caatctctac tctaaaatgt tggcaaaagt 1140 

gaagcaagtt cacaaagaag tattaaactg tggtatagca tatgtggcct tgcttactaa 1200 

agtggatgat tgcagtgagg ttcttcaaga caacttttta aacatgagta gatctatgac 1260 

ttctcaaagc cgggtcatga atgtccataa aatgctaggc attcctattt ccaatatttt 1320 

gatggttgga aattatgctt cagatttgga actggacccc atgaaggata ttctcatcct 1380 

ctctgcactg aggcagatgc tgcgggctgc agatgatttt ttagaagatt tgcctcttga 1440 

ggaaactggt gcaattgaga gagcgttaca gccctgcatt tgagataagt tgccttgatt 1500 

ctgacatttg gcccagcctg tactggtgtg ccgcaatgag agtcaatctc tattgacagc 1560 

ctgcttcaga ttttgctttt gttcgttttg ccttctgtcc ttggaacagt catatctcaa 1620 

gttcaaaggc caaaacctga gaagcggtgg gctaagatag gtcctactgc aaaccacccc 1680 

tccatatttc cgtaccattt acaattcagt ttctgtgaca tctttttaaa ccactggagg 1740 

aaaaatgaga tattctctaa tttattcttc tataacactc tatatagagc tatgtgagta 1800 

ctaatcacat tgaataatag ttataaaatt attgtataga catctgcttc ttaaacagat 1860 

tgtgagttct ttgagaaaca gcgtggattt tacttatctg tgtattcaca gagcttagca 1920 

cagtgcctgg taatgagcaa gcatacttgc cattactttt ccttcccact ctctccaaca 1980 

tcacattcac tttaaatttt tctgtatata gaaaggaaaa ctagcctggg caacatgatg 2040 

aaaccccatc tccactgc w 2058 



wy59c01.xl Soares_NSF_F8_9WJDT_PA_P_Sl Homo sapiens cDNA clone 
IMAGE:2552832 3 1 , mRNA sequence. 

ttttttcatt ctgcttttct ttattgtctg gctaacttac aaagatgcag atgtctaggg 60 

tagtctctac cctaccactt acactatcct gatgacacag atagcaaaat gtgtctgttt 120 

acatagtgca tgatatgaaa aaaaagtttt tcttcctcta cggtccttga ctataaggag 180 

ggaaaaatta atttcatgcc aacatttttg gggaacttta acaatcatcc catttctgct 240 

actaaaataa caaaactggt attacacttt aaaatataaa gacctaacag tttttacaaa 300 

tatgcaaata atctactact tagacataaa aaaaagttga tttcttttaa atcacaaagt 360 

aaggcaccat tggat.taaac atttctcctg gcttttacta aataaaatgc atagtgaaat 420 

aaatactgaa cactgagttt taatactgta atacatttca atataaaata agaggtgaat 480 

gtgaaaatac tgtattacat gttgaataca tttatctgaa aatgttataa aaaaacacac 540 

atgtaagctc tgatttc 557 



2 Homo sapiens mRNA for CHORF25 gene 



CHORF25 gene. 

/ translations "MVHHSGSIQSFKQQKGMNISKSEITKETSLKPSRRSLPCLAQSYA 
YSKSLSQSTSLFQSTESESQAPTS ITLI STDKAEQVNTEENKNDSVLRCSFADLSDFCL 
ALGKDKDYTDESEHATYDRSRLIl^ 

IDYILVYRKTNIPYDKRNTFEKNLRAEGLMLEKEPAIASPDIM 

IUjNIRMPFRKKCYYTDGRSKSMGRMQTYFRRIKDWMAQNPMVIjDKSAFPDLEESD 

PFSRARIHHFIINNKDTFFSNATRSRIVYHMLERTKYENGISKVGIRJCLINNGSYIAAF 

PPHEGAYKSSQPIKTHGPQNNRHLLYERWARWGMWYKHQPLDLIRLYFGEKIGLYFAWL 

GWYTGMLIPAAIVGLCVFFYGLFTMNNSQVSQEICKAT 

IYAKVTYLFDNGGTVFFAIFMAIWATVFLEFWKRRRS 

EAKYYKMEIVNPITGKPEPHQPSSDKVTRLLVSVSGIFFMISLVITAVFGVVVYRLVVM 
EQFAS FKWNF I KQYWQFATS AAAVCINF 1 1 IMLLNLAYEKIAYLLTNLEYPRTESEWEN 
S FAIiKMFLFQFVNIjNS S I FYXAFFLGRFVGHPGKYNKLFDRWRLEECHPSGCLIDLCLQ 
MGVIMFLKQ I WNNFMELGYPL IQNWWSRHKIKRG IHDAS I PQWENDWNIjQPMNLHGLMD 
EYLEMVLQFGFTTIFVAAFPIiAPLLALLNNI IE IRLDAYTCFVTQWRRPLPARATDIGIW 
LGILEGIGILAVITNAFVIAITSDYIPRFVYEYKYGPCANH^ 

FDLSELGMGKSGYCRYRDYRGPPWSSKPYEFTLQYWHILAARIiAF 1 1 VFEHLVFGIKSF 
IAYLI PDVPKGLHDRIRREKYLVQEMMYEAELEHLQQQRRKSGQPVHHEWP " 



ctttcaaata aggtgttgat ttatgtggtc ttgagaaatc tttgttgctc tgatcttaag 60 

tttcttttta taatattaga atcatagttt acctacatta cagggctttt atgaagatta 120 

aatcgagcat cgtgggtagt gttttttcag cccctgtgac taagtaggtt ataaatggtc 180 

ctcagtcatc atcatcataa atgattcaag gcctgaatgg cagaagaaga agaggaaaac 240 

tcaagtgaaa agtgctagac atagaacaca gcttacctaa taaaacatgc tactggctaa 300 

ctaagctggg gtggctttct gccccctcta cttaaattta agaaagaaaa acagaaacgt 360 

ctctgcatct gccactgcat accctctcag agcacagacc aactttgccc ttccttaaaa 420 

gtcccagggt ttacctacca acttttagga aataaataac cagcctcaaa gcttggcaaa 480 

gagggcatca gtatcagctc tagaggtttg ctccgattta tatctcctcc tccagtaaag 540 

tcttctgctc ccactcccag ccaaccccgc tcaacgccca cccctcactg cggcactgga 600 

cgcgcggggg atgcggggtt gttcccagag tagccgctaa ggggagccgg acgctagacc 660 

gccctcccgc gttcccatga caacgacacc ggcgggcgcg tagcctggag agcgaagtgc 720 

cggctacagc aggtgtcgga ttgcagtgcg ctcgctgagg ctccggacct tggagcgtct 780 

agagtctggc tactgttcct ccgcctccct ctcgggcagc tccctaagcc ggctgggacg 840 

cgcagagtga aaatggtcca ccattcaggc tccattcagt cctttaaaca gcaaaaaggt 900 

atgaatataa gcaagagtga gataacaaaa gaaacttcgt taaaaccgtc tcggagatcc 960 

ctgccttgcc tcgcccagag ctacgcttac tcaaagagct tgagccagtc tacttccctc 1020 

ttccagtcaa ccgagagtga atctcaggct cccacatcta taaccttaat ctccactgac 1080 

aaagcagagc aagttaatac tgaggagaat aaaaacgact ctgtgctgag atgttcattt 1140 

gctgacctca gcgatttttg tttggcccta ggaaaagata aggattacac ggatgaatca 1200 

gaacacgcta cttatgaccg atctcgtctc attaatgact ttgttatcaa agataaatct 1260 

gaattcaaga caaaattatc taagaatgac atgaattaca tagcatccag tggacctctg 1320 

ttcaaagatg gcaaaaggag aattgattac atcttggttt atagaaagac aaatatacca 1380 

tatgataaaa gaaacacatt tgaaaagaac ctcagagcag aaggcttgat gttggagaag 1440 

gagccagcta ttgcaagccc cgatatcatg tttattaaaa ttcacattcc atgggacacg 1500 

ctgtgcaagt atgcagagag gctgaatatc aggatgccct tcaggaaaaa atgctattac 1560 

actgacggga ggagcaaatc aatgggcagg atgcaaactt attttagaag aatcaaagac 1620 

tggatggccc aaaacccaat ggttcttgac aagtcagctt ttccagacct agaggagtca 1680 

gactgctata ctggcccctt cagccgtgca cggattcacc acttcataat aaataataaa 1740 

gacaccttct tcagcaatgc tactcgaagc agaatagtct atcacatgct ggaacgcacc 1800 

aaatatgaaa atggaatatc aaaagtgggt atccgtaaac ttataaacaa tggctcatac 1860 

atagcagcgt ttccaccaca tgagggagcc tacaaaagta gccagcccat taaaacccat 1920 

ggacctcaga ataacagaca tctattatat gagcgctggg cacgctgggg aatgtggtat 1980 

aagcatcagc ctctggattt aatcaggctg tactttggtg agaagattgg actatacttt 2040 



gcttggctgg gatggtatac tggaatgttg attcctgcag caattgttgg tttgtgcgtt 
ttcttctatg gattatttac aatgaataat agtcaagtaa gccaagaaat ttgtaaagcc 
actgaagtct ttatgtgccc tctctgtgac aagaactgct ccctgcagag actcaacgac 
agctgtatct atgccaaggt gacatatttg ttcgataatg gagggacagt cttctttgct 
atttttatgg caatatgggc cacagtcttc ctggagtttt ggaaaaggag aaggagtata 
ctgacctata cttgggacct tatcgaatgg gaagaagagg aggaaacact tcgtccccag 
tttgaagcca agtattacaa gatggagatt gtaaatccca tcacgggaaa acctgaacca 
catcagcctt cctcagacaa agtcactcgt cttcttgttt ctgtctcagg aatattcttc 
atgatatcct tggtgatcac tgcagtgttt ggagttgtgg tgtaccgcct ggttgtcatg 
gaacagtttg catcattcaa gtggaatttc atcaaacaat actggcagtt tgcaacatct 
qctgctgctg tctgtatcaa tttcataatc attatgttgc tgaatcttgc ttatgaaaaa 
attgcttacc tcctcaccaa tttagaatat cctcgaacag aatcagagtg ggaaaacagc 
ttcgccctga agatgttcct cttccagttt gtcaatttaa acagttccat ^tctatatc 
gct?tctttt tgggaagatt tgtaggccac ccaggaaaat acaataaact ttttgaccgg 
tggagactgg aggaatgtca tcctagtggc tgtttgatag acctctgcct ccagatgggt 
gtcatcatgt ttttgaagca aatatggaac aacttcatgg aactaggata cccgttgatc 
cagaactggt ggtcacgaca taaaatcaag cggggaatac atgatgcttc catacctcag 
tgggaaaatg attggaatct gcagcccatg aaccttcatg gactgatgga tgagtactta 
gaaatggttt tgcaatttgg ttttaccacc atctttgttg cggcttttcc tctagcccct 
^ ttttggctt tg ttaaacaa tatcattgaa atcaggctgg atgcatacaa atttgtcact 
caatggcgga ggcctttgcc agcccgagca actgacatag gtatctggct tggaattctc 
gaaggaatcg gtatattggc tgtgatcacc aatgcatttg taattgctat tacttctgat 
tacatcccac gttttgttta tgaatacaaa tatggcccct gtgcaaatca tgtagaacca 
agtgaaaatt gcttgaaggg atatgtcaac aatagcctat ccttctttga cctgagtgag 
cttggtatgg gaaaatctgg ttattgcagg tacagagact acagaggccc gccttggagt 
tccaaaccct atgagtttac tttacaatac tggcatatcc ttgctgctag attggccttc 
attattgtgt ttgagcacct tgtttttggg attaagtcat tcatcgcata cctgattcca 
gacgtaccaa agggtctaca tgaccgaata cgacgagaga agtacttagt tcaagaaatg 
atgtatgagg ctgaactgga acatttgcaa caacaacgga gaaaaagtgg tcagcctgtt 
caccatgaat ggccttagtt gacacctgtt acccattagg ggtgataaca ttaatgggaa 
gaaatgatgg caactttgaa tgctaggtga aatctaggag gaaggcatac ttggcaaacc 
acatgtataa tatgctactt ggaaatgtat cacagccatc tctgggattt gaaatatcca 
qacttgtagg gaagaaaaca atgacttgac gaccttaaaa agggttagat tgacattgca 
ggaagccagg atgtaattct cagaaccagt ctcagggaga tatatgcttg gagaactctg 
ccttccatca actgcagtgt aatgggaaag agggtggtga ggttttgaaa gacagatttc 
catgtaaatg tgcacaagcc aggcatggct taaaatatca tgcagtcttc actcacatac 
taatttgact ttggaaagta tggttgagtt cagatactta taagaaatga cttaaacaaa 
tccttgcctt ggctgccaga actctacatc ccttcagttt acaggttggt aggaattgcc 
catgtctctt gctgagatta tggatctgtg ccacagggga tttatgctgc tttacattga 
ctatgcagtt gaagagttgc acatcacctt tagtggtcaa accaaaaatc taagattcca 
aaaagaacag cattcattga aataatattc cattgcattg gctacttcaa gtctctttta 
cccctggcag agccagttgc agtgcaattt tttgtactaa atgccaataa agccacatac 
ttataaacta tgaatggcta ttcttatata gtttagaaca caactaatcc atactttatt 
gtggaaggct gttttatagt atatttaatt ttttctctat gatttattac aactctcacg 
gaatattaac ttgaaatgct gtaaatacgt tggttttcct tgtccctttg caccttgaca 
gtacgtagta tacattgtgg tttatgcagc tctgacacca gtttttgcta ttgtaatatg 
tttattcggg aaaactgagg ccagatgctc tgcaacactt acgctttttc aacaatgtgc 
actcttactc atgaactaat gaaaataatg catgaatata aatctatatt atttgctaaa 
aaaagaaaaa acatttgtgt tttagtttct tttttcatga ctgaagggag ttttatgtta 
ttatttcttc catagttttg ttttggttta tttttaacaa cgctctagaa acaaagtcaa 
attaatcaca aaagacataa ttttgctttg ttgtggaatt tctatttcaa cgtcaactct 
gtatctatga gtatgtctgt tccacagaca gatgaggcag gagtgatggg gcactcaaga 
aagttcagag gaggcataag ctatggaggt tggaaaggaa gaaagagaag agctgaagta 
aatgtatggt agaaattaag acgtttcttt gcaaaacaag gaaatcacac atatgcacac 
atacatattt atggttacta tgattgttat tatttgtcca atcaagcaaa gcaccattat 
ttctaacata taaaagacca gatcaaacac aatggaaata aagctactac atatatatgc 
cagtatttct ctagtaagat tatgtttctc ttactagaaa actattttct aaatattaac 
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4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 



actgaaaatg 
agctgtttca 
gagcaaaggg 
ttcagattcc 
actaattcca 
tttttttttt 
ataataacat 
acagtagtat 
atagtcccgt 
aaaatatttt 
ttgaaacaga 
cagtgcaata 
tgcagtgtgt 
ttttatatga 
tgtattacat 
actaatatat 
ctgtacaatg 
tctgtgcacg 
tgaatgctgc 
tagaaaactt 



ttttgttagc ttttccttct 
ttgtttgttt ttgtcaagca 
cctcagactc tgaacttccc 
agagaggttc tcatgctccc 
ctaagtacaa gggagttttt 
tgttaggtac atgtatacac 
caaccaacat cttttccaaa 
aaagaataat gatagctcta 
tagggtttaa aaccatattg 
aatttaaaaa ttgtaataca 
tgtgaaaaac agaaaaagaa 
gaatctagtt atgccttcat 
tagttttaga tcactctgtt 
atacaaatta tttctcaaag 
gaatgctgct tatatatttt 
atgtgcatgc agtctgcctt 
tgtggaaaat caccatagat 
tacttttaga taaaatgctg 
acattcttcc aaaatgatcc 
tttcactcaa taaattatta 



ttctctccag aagaaacatg 
tattcacttt cctccttgtc 
tcaagtgccg ttgttatgtg 
cccccctcct tatttgtagc 
tacactcctc catttttata 
ctgcctgagt ataaatactc 
ttagggccac agaacagcaa 
tccttaagaa gtatttcctt 
atcaactaga aagaaaaata 
ttgatttata aaatgccttc 
aaaattgtct gaaatgttta 
cactgttgac agtaaatact 
ttagttgaga gaaatgtttt 
atttatagca cacactattc 
catattctaa cttgtctttt 
gacaagttgt tccaagctga 
catggctgaa atagtttgta 
ctgagtgact gcatgatgag 
ttagcacaat ctattgtatg 
tttgatatgg t 



yauayaLyat. 
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ctctgattcc 


5580 


aactcttcca 


5640 


ddtcytayca 


5700 


gcatctgcat 


5760 


4— +— .ri 4— — % -~ 4— — 

uccccaccca 


5820 
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CattUy LCuy 


5880 
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tcctttttat 


5940 


uyaaaayaga 


6000 
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tctgatactt 


6060 


ttttgcaaaa 


6120 


gacagcccct 


6180 


atatcatggt 


6240 


tcaggaattc 


6300 


caagcaaata 


6360 


agagctttca 


6420 


attgtctgag 


6480 


atacaacttc 


6540 


atggaatgaa 


6600 




6641 



Homo sapiens isopentenyl -diphosphate delta isomerase, mKNA (cDNA clone 
/ t ranslation="MMPEIimJHLDKQQVQIiAE^ 

IEKGLIiHRAFSVFLPNTENKIjLIiQQRSDAKITFPGCFTNTCCSHPIjSNPAEL>EESDALG 

VRRAAQRKLKAEIiGI PLEEVPPEE INYLTRIHYKAQSDG IWGEHE ID Y ILLVRKNVTLN 
PDPNEIKSYCYVSKEELKELLKKAASGEIKITPWFKIIAATFLFKWWDIJIiNHIjNQFVDH 



EKIYRM" 



Sequence 1911 BP; 651 A; 298 C; 375 G; 587 T; 0 other; 
gtgttctaga acagatcaga cattttgtaa tgatgcctga aataaacact aaccacctcg 
acaagcaaca ggttcaactc ctggcagaga tgtgtatcct tattgatgaa aatgacaata 
aaattggagc tgagaccaag aagaattgtc acctgaacga gaacattgag aaaggattat 
tgcatcgagc ttttagtgtc ttcttattca acaccgaaaa taagcttctg ctacagcaaa 
gatcagatgc taagattacc tttccaggtt gttttacgaa tacgtgttgt agtcatccat 
taagcaatcc agccgagctt gaggaaagtg acgcccttgg agtgaggcga gcagcacaga 
gacggctgaa agctgagcta ggaattccct tggaagaggt tcctccagaa gaaattaatt 
atttaacacg aattcactac aaagctcagt ctgatggtat ctggggtgaa catgaaattg 
attacatttt gttggtgagg aagaatgtaa ctttgaatcc agatcccaat gagattaaaa 
gctattgtta tgtgtcaaag gaagaactaa aagaacttct gaaaaaagca gccagtggtg 
aaattaagat aacgccatgg tttaaaatta ttgcagcgac ttttctcttt aaatggtggg 
ataacttaaa tcatttgaat cagtttgttg accatgagaa aatatacaga atgtgaatat 
gtaggtaaat gattacagaa aaatttatct gcttaacaaa cttagaatga ctttttcctt 
ttaaatttag ttctatcatt aatttatcat taaatttagt tctatcattt ggtactatca 
ttaatgtatt atatacactg atactttaaa acttgtgtgg aaaaaactaa cttataattt 
tgtatcacac accctggata tgtgttctgt ttctaagcga catttgtgag agattattgt 
aaaatgagag cgagcaaata aaacttaatt taatctttgc agatacatac ttatgggaaa 
tttgaacaaa tgagtgaaac tctgtgtttt tagtaggctg tgataaacat ttccggagca 
cttgcagagg acttgctatt tgccaggtgc tttatgtatc attaaatttt tctcatagtt 
cagaaaaatg tgcaaaggaa actattgtct cgctccttca aaacagtctt aattaacttt 
catattagca gattaaacta gcagagcagg ttcaagggaa attaaatgat atggacccta 
atttgtatca ttctgagttg attgtgtggt ttattcattc tggaaacatg ttgatactta 
cagtcagcca ctgcttttga taagtgatat tgattaggtt gaatcttctt gtaaatagta 
tttaccagtt agcaaagtct gtgttttcag aattacagtg agcacagagg tgttcataaa 
atgggaattg agtcccactc ggtaagagtt gcttaaactt gacactgttg acatttgggc 
tgqataaaac ccctgtggtg gggtctgtgc tgtgcattgc aggatggtga gcagcgtccc 
tctcatgtga cacccacagt tatgccggat gttgccagat gcccctaggg gacagagtca 
acccccaact gaggaccact gtcctacaga gtcaggaaat attgtaggga gaaaaaaata 
acaacaacaa aggcctgtgt taatgttaaa tagatgagat tatggaatgt gtatattaat 
gttaaaaatt gtaccttgat caatgtactt tttataaact tgccatagat atctcagatt 
tgaaacctca agacagattt attattctta aatgctgtat gataatgaag aaaaataaaa 
atttatttct tgcaaagtta caaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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1140 
1200 
1260 
1320 
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1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1911 



DE Human prostaglandin endoperoxide synthase mRNA, complete cds 
•ex 

•CW prostaglandin endoperoxide synthase. 
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/trans lation= "msrslllrfllfllllpplpvlladpgaptpvnpccyypcqhqgi 

CVRFGLDRYQCDCTRTGYSGPNCTIPGLWTWLIUJSLRPSPSFTHPLLTHGRWFWEFVNA 
TFIREMLMRLVLTVRSNIjIPSPPTyNSAHDYlSWESFSNVSYYTRILPSVPKDCPTPMG 
TKGKKQLPDAQLLARRFLIiRRKFI PDPQGTNLMFAFFAQHFTHQFFKTS GKMGPGFTKA 

LGHGVDLGHIYGDNLERQYQIJU J FKIK3KLKYQVI^GEMYPPSVEEAPVLMHYPRGIPPQ 
SQMAVGQEVFGLLPGIiMLYATLWIiREHNRVCDLLKAEHPTWGDBQLFQTTRLILIGETI 
KIVIEEWQQLSGYFLQLKFDPELLFGVQFQYIUnilAMEFNHLYHWHPLMPDSFKVGSQ 
EYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGRIGGGRNMDHHILHVAVDVIRESREMR 
LQPFNEYRKRFGMKPYTSFQELVGEKEMAAELEELYGDIDAliEFYPfST.T.T.RvrTiDKreTj? 



60 
120 
180 



1020 
1080 
1140 



gcgccatgag ccggagtctc ttgctccggt tcttgctgtt cctgctcctg ctcccgccgc 
tccccgtcct gctcgcggac ccaggggcgc ccacgccagt gaatccctgt tgttactatc 
catgccagca ccagggcatc tgtgtccgct tcggccttga ccgctaccag tgtgactgca . av 

cccgcacggg ctattccggc cccaactgca ccatccctgg cctgtggacc tggctccgga 240 
attcactgcg gcccagcccc tctttcaccc acttcctgct cactcacggg cgctggttct 300 
gggagtttgt caatgccacc ttcatccgag agatgctcat gcgcctggta ctcacagtgc 360 
gctccaacct tatccccagt ccccccacct acaactcagc acatgactac atcagctggg 42 0 
agtctttctc caacgtgagc tattacactc gtattctgcc ctctgtgcct aaagattgcc 480 
ccacacccat gggaaccaaa gggaagaagc agttgccaga tgcccagctc ctggcccgcc 540 
gcttcctgct caggaggaag ttcatacctg acccccaagg caccaacctc atgtttgcct 600 
tctttgcaca acacttcacc caccagttct tcaaaacttc tggcaagatg ggtcctggct 660 
tcaccaaggc cttgggccat ggggtagacc tcggccacat ttatggagac aatctggagc 720 
gtcagtatca actgcggctc tttaaggatg ggaaactcaa gtaccaggtg ctggatggag 780 
aaatgtaccc gccctcggta gaagaggcgc ctgtgttgat gcactacccc cgaggcatcc 840 
cgccccagag ccagatggct gtgggccagg aggtgtttgg gctgcttcct gggctcatgc 900 
tgtatgccac gctctggcta cgtgagcaca accgtgtgtg tgacctgctg aaggctgagc 960 
accccacctg gggcgatgag cagcttttcc agacgacccg cctcatcctc ataggggaga 
ccatcaagat tgtcatcgag gagtacgtgc agcagctgag tggctatttc ctgcagctga 
aatttgaccc agagctgctg ttcggtgtcc agttccaata ccgcaaccgc attgccatgg 

agttcaacca tctctaccac tggcaccccc tcatgcctga ctccttcaag gtgggctccc 12 00 

ctac 9 a 9 ca 9 ttcttgttca acacctccat gttggtggac tatggggttg 1260 

aggccctggt ggatgccttc tctcgccaga ttgctggccg gatcggtggg ggcaggaaca 1320 

tggaccacca catcctgcat gtggctgtgg atgtcatcag ggagtctcgg gagatgcggc 1380 

tgcagccctt caatgagtac cgcaagaggt ttggcatgaa accctacacc tccttccagg 1440 

agctcgtagg agagaaggag atggcagcag agttggagga attgtatgga gacattgatg 15 00 

cgttggagtt ctaccctgga ctgcttcttg aaaagtgcca tccaaactct atctttgggg 15 60 

agagtatgat agagattggg gctccctttt ccctcaaggg tctcctaggg aatcccatct 1620 

gttctccgga gtactggaag ccgagcacat ttggcggcga ggtgggcttt aacattgtca 1680 

agacggccac actgaagaag ctggtctgcc tcaacaccaa gacctgtccc tacgtttcct 1740 

tccgtgtgcc ggatgccagt caggatgatg ggcctgctgt ggagcgacca tccacagagc 1800 

tctgaggggc aggaaagcag cattctggag gggagagctt tgtgcttgtc attccagagt i860 

gctgaggcca gggctgatgg tcttaaatgc tcattttctg gtttggcatg gtgagtgttg 1920 

gggttgacat ttagaacttt aagtctcacc cattatctgg aatattgtga ttctgtttat i 98 0 

tcttccagaa tgctgaactc cttgttagcc cttcagattg ttaggagtgg ttctcatttg 2040 

gtctgccaga atactgggtt cttagttgac aacctagaat gtcagatttc tggttgattt 2 100 

gtaacacagt cattctagga tgtggagcta ctgatgaaat ctgctagaaa gttagggggt 2160 

tcttattttg cattccagaa tcttgacttt ctgattggtg attcaaagtg ttgtgttccc 2220 

tggctgatga tccagaacag tggctcgtat cccaaatctg tcagcatctg gctgtctaga 2280 
atgtggattt gattcatttt cctgttcagt gagatatcat agagacggag atcctaaggt 



~ — — ' " -3-3**- «us*v.uuouu u. 2340 

ccaacaagaa tgcattccct gaatctgtgc ctgcactgag agggcaagga agtggggtgt 2400 
tcttcttggg acccccacta agaccctggt ctgaggatgt agagagaaca ggtgggctgt 2460 



attcacgcca ttggttggaa gctaccagag ctctatcccc atccaggtct tgactcatgg 2520 
cagctgtttc tcatgaagct aataaaattc gccc 2554 



)E 
)E 



602381868P1 NIH_MGC_93 Homo sapiens cDNA clone IMAGE : 44 993 93 5«, mRNA 
sequence . 



tgtgaataga caagaagctg tactatatgt gctctctcag tggcaacaat gaagttttgc 60 

aattctagaa cttggatttt ttttttaaca aaagtcccaa aacaccaaaa atgtaaacaa 120 

gataagagat taatattgta gtgatgtaat ttaattaaag ttatattttg ggttaatttt 180 

aacaactgaa gtcttattgt tgaaacttat tttcaacaaa actgtgcagt taaatttgta 240 

tacgtattca catactgaaa gatgaaccgt taaaatagca cttaatttgt gtttcttcaa 300 

tatgtcttga tatactttgt gcaattaata ttacacatgt aagttgtatg gcagtttaca 360 

gaactcaatg acttgtcatg aggttttcat atgagctaca cattgtgtac attgatggtt 420 

ttttattttt acataaatcc attctgtcat tttcaacttt atatataaat ctccaatgtt 480 

atgggaaaca atagattgac acataatttt taaaaattat attgtaaaat ttctctatgg 540 

tgaataaagt cttttaatat aaaaaaaaaa aaaaaaaaaa gaaacaaaaa aagaaaaaaa 600 

aaaaaaaaaa aaaaaaaaaa aggggggggg ggaaaaaaaa accacggggg gcacaaatct 660 

atccgccacc cacgtttaga tcaaaggggc cccaagagag agacaaaaga aagcgacggc 720 

gacacaacaa ccgggggcac acgcgtacga ctagggagag cacaatcgcg gtagtaggac 780 

acacacaaaa aacgagaaca aacaggaccg tgacaccacc tgcgattgcc taataaaaag 840 

gcagaaacgg cacgcacagc gacgagcacg cagcagaaac accacacgca gcaccatgta " 900 

c 901 



Homo sapiens mRNA for quinolinate phosphoribosyl transferase, complete cds. 

nicotinate mononucleotide pyrophosphorylase; QPRTase; 
quinolinate phosphoribosyl transferase. 

/translation="^AEGLALLLPPVTIiAAIiVDSWLREDCPGI J NYAALVSGAGPSQAA 

LWAKSPGVLAGQPFFDAI FTQIiNCQV S WFLPEGS KLVPVARVAEVRGPAHCLIiIjGERVA 
I,jIXliARCSGIASAAAAAVEAARGAGWTGHVAGTRKrTPGFRliVEK3fGI»LVGGAASHRyD 
LGGIiVMLKDNH\A7PPGGVEKAVRA^QAADFAIjKVEVECSSIjQEVVQAAEAGADLVLIjD 
NFKPEELHPTATALKAQFPSVAVEASGGITLDNLPQFCGPHIDVISMGMLTQAVPALDF 

SLKXiFAKEVAPVPKIH " 

atggacgctg aaggcctggc gctgctgctg ccgcccgtca ccctggcagc cctggtggac 
agctggctcc gagaggactg cccagggctc aactacgcag ccttggtcag cggggcaggc 
ccctcgcagg cggcgctgtg ggccaaatcc cctggggtac tggcagggca gcctttcttc 
gatgccatat ttacccaact caactgccaa gtctcctggt tcctccccga gggatcgaag 
ctggtgccgg tggccagagt ggccgaggtc cggggccctg cccactgcct gctgctgggg 
gaacgggtgg ccctcaacac gctggcccgc tgcagtggca ttgccagtgc tgccgccgct 
qcagtggagg ccgccagggg ggccggctgg actgggcacg tggcaggcac gaggaagacc 
acgccaggct tccggctggt ggagaagtat gggctcctgg tgggcggggc cgcctcgcac 
cgctacgacc tgggagggct ggtgatgttg aaggataacc atgtggtgcc ccccggtggc 
g?ggagaagg cggtgcgggc ggccagacag gcggctgact tcgctctgaa ggtggaagtg 
gaatgcagca gcctgcagga ggtcgtccag gcagctgagg ctggcgccga ccttgtcctg 
ctggacaact tcaagccaga ggagctgcac cccacggcca ccgcgctgaa ggcccagttc 
ccgagtgtgg ctgtggaagc cagtgggggc atcaccctgg acaacctccc ccagttctgc 
gggccgcaca tagacgtcat ctccatgggg atgctgaccc aggcggtccc agcccttgat 
ttctccctca agctgtttgc caaagaggtg gctccagtgc ccaaaatcca ctag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
894 



Homo sapiens mRNA for cytochrome P-450 HFLa, complete cds. 



CYP3A6; cytochrome P-450; human fetal liver cytochrome P-450. 

/ translation* " MDL I PNIiAVETWLLLAVSL I IiLYIjYGTRTHGLFKKIjGI PGPTPLP 
FLGNALS FRKGYWTFDMEC YKKYRKVWGI YDCQQPMLAI TDPDMI KTVLVKECYSVFTN 
RRPFGPVGFMKNAIS IAEDEEWKRIRSLLSPTFTSGKLKEMVP I IAQYGDVLVRNLRRE 
AETGKPVTLKHVTGAYSMDVITSTSFGVSIDS 

IKVFPFLTPILEALNITVFPRKVISFLTKSVKQIKEGRLKETQKHRVXJFL 
KDSETHKALSDLEIjMAQS I IFIFAGYETTSS VLSFI IYELATHPDVQQKVQKEIDTVIiP 
NKAPPTYDTVLQLEYT^MVVNETLRLFPVAMRLERVCKKDVE INGMFI PKGWVMIPS Y 
VLHHDPKYWTEPEKFLPERFSKKNKDNIDP^ 

VLQNFSFKPCKETQIPLKLRFGGIiLLTEKPIVLKAESRDETVSGA" 



gtgatggatc tcatcccaaa cttggccgtg gaaacctggc ttctcctggc tgtcagcctg 60 

atactcctct atctatatgg aacccgtaca catggacttt ttaagaagct tggaattcca 120 

gggcccacac ctctgccttt tttgggaaat gctttgtcct tccgtaaggg ctattggacg 180 

tttgacatgg aatgttataa aaagtataga aaagtctggg gtatttatga ctgtcaacag 24 0 

cctatgctgg ctatcacaga tcccgacatg atcaaaacag tgctagtgaa agaatgttat 300 

tctgtcttca caaaccggag gcctttcggg ccagtgggat ttatgaaaaa tgccatctct 360 

atagctgagg atgaagaatg gaagagaata cgatcattgc tgtctccaac attcaccagc 420 

ggaaaactca aggagatggt ccctatcatt gcccagtatg gagatgtgtt ggtgagaaat 480 

ctgaggcggg aagcagagac aggcaagcct gtcaccttga aacacgtctt tggggcctac 540 

agcatggatg tgatcactag cacatcattt ggagtgagca tcgactctct caacaatcca 600 

caagacccct ttgtggaaaa caccaagaag cttttaagat ttaatccatt agatccattc 660 

gttctctcaa taaaagtctt tccattcctt accccaattc ttgaagcatt aaatatcact 720 

gtgtttccaa gaaaagttat aagttttcta acaaaatctg taaaacagat aaaagaaggt 780 

cgcctcaaag agacacaaaa gcaccgagtg gatttccttc agctgatgat tgactctcag 84 0 

aattcaaaag actctgagac ccacaaagct ctgtctgatc tggagctcat ggcccaatca 900 

attatcttta tttttgctgg ctatgaaacc acgagcagtg ttctctcctt cattatatat 960 

gaactggcca ctcaccctga tgtccagcag aaagtgcaga aggaaattga tacagtttta 1020 

cccaataagg caccacccac ctatgatact gtgctacagt tggagtatct tgacatggtg 1080 

gtgaatgaaa cactcagatt attcccagtt gctatgagac ttgagagggt ctgcaaaaaa 114 0 

gatgttgaaa tcaatgggat gtttattccc aaaggggtgg tggtgatgat tccaagctat 1200 

gttcttcatc atgacccaaa gtactggaca gagcctgaga agttcctccc tgaaaggttc 1260 

agtaaaaaga acaaggacaa catagatcct tacatataca caccctttgg aagtggaccc 1320 

agaaactgca ttggcatgag gtttgctctc gtgaacatga aacttgctct agtcagagtc 1380 

cttcagaact tctccttcaa accttgtaaa gaaacacaga tccccctgaa attacgcttt 1440 

ggaggacttc ttctaacaga aaaacccatt gttctaaagg ctgagtcaag ggatgagacc 1500 

gtaagtggag cctgatttcc ctaaggactt ctggtttgct ctttaagaaa gctgtgcccc 1560 

agaacaccag agacctcaaa ttactttaca aatagaaccc tgaaatgaag acgggcttca 1620 

tccaatgtgc tgcataaata atcagggatt ctgtacgtgc attgtgctct ctcatggtct 1680 

gtatagagtg ttatacttgg taatatagag gagatgacca aatcagtgct ggggaagtag 1740 

atttggcttc tctgcttctc ataggactat ctccaccacc cccagttagc accattaact 1800 

cctcctgagc tctgataaca taattaacat ttctcaataa tttcaaccac aatcattaat 1860 

aaaaatagga attattttga tggctctaac agtgacattt atatcatgtg ttatatctgt 1920 

agtattctat agtaagcttt atattaagca aatcaataaa aacctcttta c 1971 



Human mRNA for endothelin converting enzyme, complete cds. 

endothelin converting enzyme. 

/trans 1 a t i on= n MRGVWPPPVSALLSALGMSTYKRATIjDEEDLVDSIjSEGDAYPNGL 
QWFHSPRSGQRCWAARTQVEKRLVVLWLI^ 

VSVTS S IIiS SMDPTVDPCHDFFS YACGGWI KANP VPDGHSRWGTFSNIiWEHNQAI IKHL 
LENSTASVSEAERKAQV^rYRAOOTETRIEEIjRAI^ 

TLQVVTAHYRTS PFFS VYVS ADSKNSNSNVI QVI3QSGLGLP SRDYYTiNKTENEKVIjTG Y 

IdmiVQLGKLLGGGDEEAJlRPQMQQIIiDFETAI^ 

TLAPAINWLPFLNTIFYPVEINESEPIVVY^ 

RKTS S FLDQRFQDADEKFMEVMYGTKKTCLPRWKFCVSDTENNLGFALGPMFVKATFAE 

DSKSIATEIILEIKKAFEESIaSTIiKW^EETRKSAKEKADAIYNMIGV^ 

KVFNDYTA VTDLYFENAMRFFNFS WRVTADQLRKAPNRDQWSMT^ PTKNE I 

VFPAGI LQAPFYTRS S PKAIiNFGGIGWVGHELTHAFDDQGRE YDKDGNLRPWVTKNS S V 

EAFKRQTEOWEQYSI^SVNGEPVNGRHTLG 

IiPTLGLTNNQLFFLGFAQVWCSVRTPES SHEGLITDPHS PSRFRVIGSIiSNSKEFSEHF 
RCPPGSPMNPPHKCEVW ■ 



gcggggcg tgtggccgcc cccggtgtcc gccctgctgt cggcgctggg gatgtcgacg 60 

tacaagcggg ccacgctgga cgaggaggac ctggtggact cgctctccga gggcgacgca 120 

taccccaacg gcctgcaggt gaacttccac agcccccgga gtggccagag gtgctgggct 180 

gcacggaccc aggtggagaa gcggctggtg gtgttggtgg tacttctggc ggcaggactg 240 

gtggcctgct tggcagcact gggcatccag taccagacaa gatccccctc tgtgtgcctg 300 

agcgaagctt gtgtctcagt gaccagctcc atcttgagct ccatggaccc cacagtggac 360 

ccctgccatg acttcttcag ctacgcctgt gggggctgga tcaaggccaa cccagtccct 42 0 

gatggccact cacgctgggg gaccttcagc aacctctggg aacacaacca agcaatcatc 480 

aagcacctcc tcgaaaactc cacggccagc gtgagcgagg cagagagaaa ggcgcaagta 54 0 

tactaccgtg cgtgcatgaa cgagaccagg atcgaggagc tcagggccaa acctctaatg 600 

gagttgattg agaggctcgg gggctggaac atcacaggtc cctgggccaa ggacaacttc 660 

caggacaccc tgcaggtggt caccgcccac taccgcacct cacccttctt ctctgtctat 720 

gtcagtgccg attccaagaa ctccaacagc aacgtgatcc aggtggacca gtctggcctg 780 

ggcttgccct cgagagacta ttacctgaac aaaactgaaa acgagaaggt gctgaccgga 840 

tatctgaact acatggtcca gctggggaag ctgctgggcg gcggggacga ggaggccatc 900 

cggccccaga tgcagcagat cttggacttt gagacggcac tggccaacat caccatccca 960 

caggagaagc gccgtgatga ggagctcatc taccacaaag tgacggcagc cgagctgcag 1020 

accttggcac ccgccatcaa ctggttgcct tttctcaaca ccatcttcta ccccgtggag 1080 

atcaatgaat ccgagcctat tgtggtctat gacaaggaat accttgagca gatctccact 1140 

ctcatcaaca ccaccgacag atgcctgctc aacaactaca tgatctggaa cctggtgcgg 1200 

aaaacaagct ccttccttga ccagcgcttt caggacgccg atgagaagtt catggaagtc 1260 

atgtacggga ccaagaagac ctgtcttcct cgctggaagt tttgcgtgag tgacacagaa 1320 

aacaacctgg gctttgcgtt gggccccatg tttgtcaaag caaccttcgc cgaggacagc 13 80 

aagagcatag ccaccgagat catcctggag attaagaagg catttgagga aagcctgagc 1440 

accctgaagt ggatggatga ggaaacccga aaatcagcca aggaaaaggc cgatgccatc 1500 

tacaacatga taggataccc caacttcatc atggatccca aggagctgga caaagtgttt 1560 

aatgactaca ctgcagttcc agacctctac tttgaaaatg ccatgcggtt tttcaacttc 1620 

tcatggaggg tcactgccga tcagctcagg aaagccccca acagagatca gtggagcatg 1680 

accccgccca tggtgaacgc ctactactcg cccaccaaga atgagattgt gtttccggcc 1740 

gggatcctgc aggcaccatt ctacacacgc tcctcaccca aggccttaaa ctttggtggc 1800 

ataggtgtcg tcgtgggcca tgagctgact catgcttttg atgatcaagg acgggagtat 1860 

gacaaggacg ggaacctccg gccatggtgg aagaactcat ccgtggaggc cttcaagcgt 1920 

cagaccgagt gcatggtaga gcagtacagc aactacagcg tgaacgggga gccggtgaac 1980 

gggcggcaca ccctggggga gaacatcgcc gacaacgggg gtctcaaggc ggcctatcgg 2040 

gcttaccaga actgggtgaa gaagaacggg gctgagcact cgctccccac cctgggcctc 2100 

accaataacc agctcttctt cctgggcttt gcacaggtct ggtgctccgt ccgcacacct 2160 

gagagctccc acgaaggcct catcaccgat ccccacagcc cctctcgctt ccgggtcatc 2220 



ggctccctct ccaattccaa ggagttctca gaacacttcc gctgcccacc tggctcaccc 2280 

atgaacccgc ctcacaagtg cgaagtctgg taaggacgaa gcggagagag ccaagacgga 2340 

ggaggggaag gggctgagga cgagaccccc atccagcctc cagggcattg ctcagcccgc 2400 
ttggccacc ~ - 



602386668F1 NIH_MGC_93 Homo sapiens cDNA clone IMAGE: 4515521 5', mRNA 
sequence . 



EST. 



gcagaatgga agcttagagg aacttgcctg tgagcgctgg tcttgtgttg gtttgtgatg 60 

taacgatctt gctggggttt tttgcttgtt ttgagggaaa tgtcttggag taaattttaa 120 

gttcctggag ttaatttgtt ttacaggaat ttgtttttta aaaaaatagg atcattctga 180 

acttggaatg acccccttat atattttctg aaaatgaaaa cagttacatg aaaaaaattt 240 

ccaatgaaga tgtcagcatt ttatgaaaaa ccagaagtta ttagatgaaa gcagcgagtg 300 

aatctttaaa acagacttga tcacgcacac acaataagtc tttctctccg aaaccggaag 360 

taaatctata tctgttagaa ataatgtagc caaaagaatg taaatttgag gattttttgc 42 0 

caatagttta tagaaaatat atgaaccaaa gtgatttgag tttgtaaaaa tgtaaaatag 480 

tatgaacaaa atttgcactc taccagattt gaacatctag tgaggttcac attcatacta 54 0 
agttttcaac attgtgttct tttggcattc attttttact tttattaaag gttcaaaacc 
aaaaaagaaa aaaag 



600 
615 



S Homo sapiens mRNA for Rev-ErbAalpha protein (hRev gene) 
C 

* hRev gene; Rev-ErbAalpha; thyroid hormone receptor. 



C / translation^ "MTTLDSNNNTGGVITYIGSSGSSPSRTSPESLYSDITSNGSFQSLT 

C QGCPTYFPPSPTGSLTQDPARSFGSIPPSLSDDGSPSSSSSSSSSSSSFYNGSPPGSIiQ 

P VAMEDS SRVS P S KSTSNI TKI^GMVTjLCKVCGDVASGFHYG V1ACEGCKGFFRRS I QQN 

C IQYKRCLKNENCS I VllINRNRCQQCRFKKCLSVGMSRDAVRFGRI PKREKQRMLAEMQS 

C AMNIA^QLSSQCPIiETSPTQHPTPGPMGP^^ 

C EPTVEDVT SQVARAHREI FTYAHDKLGS S PGNFNANHAS GS PPATTPHRWENQGCPPAP 

T NDNNTLAAQRHNEAIiNGIiRQAPS S YPPTWPPGPAHHS CHQSNSNGHRLCPTHVYAAPEG 

V KAP ANS PRQGNS KNVLIiACPMNMYPHGRSGRTVQE I WEDFSMS FTPAVRE WEFAKHIP 

T GFRDLSQHDQVTLLKAGTFEVIdWRFASLFN^^ 

C SAMFDFSEKLNSI1ALTEEEI1GLFTAVVLVSADRSG 

C NRPLETSRFTKIiLLKLPDLRTIjNNMHSEKLLSFRVDAQ n 



ccgttgcctc aacgtccaac ccttctgcag ggctgcagtc cggccacccc aagaccttgc 60 

tgcagggtgc ttcggatcct gatcgtgagt cgcggggtcc actccccgcc cttagccagt 120 

gcccaggggg caacagcggc gatcgcaacc tctagtttga gtcaaggtcc agtttgaatg 180 

accgctctca gctggtgaag acatgaccac cctggactcc aacaacaaca caggtggcgt 240 

catcacctac attggctcca gtggctcctc cccaagccgc accagccctg aatccctcta 300 

tagtgacaac tccaatggca gcttccagtc cctgacccaa ggctgtccca cctacttccc 360 

accatccccc actggctccc tcacccaaga cccggctcgc tcctttggga gcattccacc 420 

cagcctgagt gatgacggct ccccttcttc ctcatcttcc tcgtcgtcat cctcctcctc 4 80 

cttctataat gggagccccc ctgggagtct acaagtggcc atggaggaca gcagccgagt 540 

gtcccccagc aagagcacca gcaacatcac caagctgaat ggcatggtgt tactgtgtaa 600 

agtgtgtggg gacgttgcct cgggcttcca ctacggtgtg ctcgcctgcg agggctgcaa 660 

gggctttttc cgtcggagca tccagcagaa catccagtac aaaaggtgtc tgaagaatga 720 

gaattgctcc atcgtccgca tcaatcgcaa ccgctgccag caatgtcgct tcaagaagtg 780 

tctctctgtg ggcatgtctc gagacgctgt gcgttttggg cgcatcccca aacgagagaa 840 

gcagcggatg cttgctgaga tgcagagtgc catgaacctg gccaacaacc agttgagcag 900 

ccagtgcccg ctggagactt cacccaccca gcaccccacc ccaggcccca tgggcccctc 960 

gccaccccct gctccggtcc cctcacccct ggtgggcttc tcccagtttc cacaacagct 1020 

gacgcctccc agatccccaa gccctgagcc cacagtggag gatgtgatat cccaggtggc 1080 

ccgggcccat cgagagatct tcacctacgc ccatgacaag ctgggcagct cacctggcaa 1140 

cttcaatgcc aaccatgcat caggtagccc tccagccacc accccacatc gctgggaaaa 12 00 

tcagggctgc ccacctgccc ccaatgacaa caacaccttg gctgcccagc gtcataacga 1260 

ggccctaaat ggtctgcgcc aggctccctc ctcctaccct cccacctggc ctcctggccc 1320 

tgcacaccac agctgccacc agtccaacag caacgggcac cgtctatgcc ccacccacgt 13 80 

gtatgcagcc ccagaaggca aggcacctgc caacagtccc cggcagggca actcaaagaa 1440 

tgttctgctg gcatgtccta tgaacatgta cccgcatgga cgcagtgggc gaacggtgca 1500 

ggagatctgg gaggatttct ccatgagctt cacgcccgct gtgcgggagg tggtagagtt 1560 

tgccaaacac atcccgggct tccgtgacct ttctcagcat gaccaagtca ccctgcttaa 1620 

ggctggcacc tttgaggtgc tgatggtgcg ctttgcttcg ttgttcaacg tgaaggacca 1680 

gacagtgatg ttcctaagcc ggaccaccta cagcctgcag gagcttggtg ccatgggcat 1740 

gggagacctg ctcagtgcca tgttcgactt cagcgagaag ctcaactccc tggcgcttac 1800 

cgaggaggag ctgggcctct tcaccgcggt ggtgcttgtc tctgcagacc gctcgggcat 1860 

ggagaattcc gcttcggtgg agcagctcca ggagacgctg ctgcgggctc ttcgggctct 1920 

ggtgctgaag aaccggccct tggagacttc ccgcttcacc aagctgctgc tcaagctgcc 1980 

ggacctgcgg accctgaaca acatgcattc cgagaagctg ctgtccttcc gggtggacgc 2040 

ccagtgaccc gcccggccgg ccttctgccg ctgccccctt gtacagaatc gaactctgca 2100 

cttctctctc ctttacgaga cgaaaaggaa aagcaaacca gaatcttatt tatattgtta " 2160 

taaaatattc caagatgagc ctctggcccc ctgagccttc ttgtaaatac ctgcctccct 2220 

cccccatcac cgaacttccc ctcctcccct atttaaacca ctctgtctcc cccacaaccc 2280 

tcccctggcc ctctgatttg ttctgttcct gtctcaaatc caatagttca cagctaaaaa 2340 

aaaaaaaaaa aaaag 2355 



Homo sapiens insulin induced protein 1 (INSIG1) gene, complete cds. 

/ trans 1 at ion= " MPRLHDHFWS CSCAHSARRRGPPRASTAGLPPKVGEMINVSVSGP 

SLLAAHGAPDADPAPRGRSAAMSGPEPGSPYPOTWHHRLL 

LQIQIWVTI-FPEEVIATIFSSAVmVPPCCGTAAAWGIiLYPCIDSHIiGEPHCT 

VMRCIAGFGGINHASAKLDFANNVQIjSIjTIiAAL^ 

IjITQFLVYNGWQYTSPDFLYIRSWLPCIFFSGGVTVGNIGRQLAMGVPEKPHSD" 

exon 4978.. 5102 

/number =2 

/gene="INSIGl n 
exon 5665.. 5831 

/number =3 

/gene="INSIGl n 
exon 6136.. 6235 

/number =4 

/gene="INSIGl M 
exon 10635. ->12003 

/numbers 5 

/gene= tt INSIGl° 



Sequence 12003 BP; 2935 A; 2843 C; 2868 G; 3356 T; 1 other; 
agctcacagt ggggtcgagg agacacagca caggcttctg aaaggtgccc tgtccattag 
ggagcacccc atttacagat gggtcatcct tcagctcact agtttcccag cgcatgacag 
tgggtagctt ttttttttcc tattataaac aaatcttcga tgtgactgcg atgatctacg 
gatgatctac gtatctggtt gtgtgatcct caagcactta tgcaaggtgg agtgtaaaca 
atggccaaga cccaaactgc aatgttccct gaagtccaac ctgcttggag tccttgggaa 
cgtgtacaat ggttaacagg cttgagagga gtcagctggg agagtgagag cgagactgcc 
ctgcagaggg gccgctgagg atggacccct gtaggccgtc cacacccacc acgcgtgtca 
tcctcaggaa atgcaaccgg atgctcggaa tttcgcccct aaagcgactc agggcagtct 
cagctcagga agacttcttt ttttgacctt ctggcttaaa gccacaaccc caggctgagg 
atgccacatc aacggcgttt cctcgccacg ctgcaggcgt gactcacgtt tctaagcctg 
cagggtcccc gggccaagcg cggctactga gagcacagcg tcctccctca gcccctgctc 
atctgcacgg agcaaagccg cagctttctc tgctgtcagc gctttatacg actcggatgt 
aggctctgag cgggcacggg cccgccctca gacggcgacc cgaggggcaa acaggatgga 
ggcgcgctcc cttggaggct gtggtctcga ttttatctgc tcctgggtgg tgcgggtggt 
ttgctctcta ggcgtgagca aaggccacaa ggggcccgaa aaaatcaccc aaggatgcaa 
gcccgccatt gcgctgagca ggggcctccc cgccgccaga gcccgcgccc ctccagggag 
gcccgcgcga tcgagacccg ccgcctcccg accaggcgcg ggacaccagg cgaggggaga 
caggagccgg gaggggaggg gccgggcccg caaccccacc cagtcctgcc tccgcctcct 
cccggtagga ccgtcccctc ccaggcccca gctccgcggc cgcctcacca gaccgcgtgc 
ggacgggctc gcgggcgggg cggggcaagc tcaggccacg cccctgggcg gtgcgcgcgg 
gcggtgggta ggccgggcag gctcacgtga tgcgggcccc gcggcccgcg gatataaacc 
cgcgcgcccg ccaggcgctg cggccgtccc gggccgtgac tcctcctttc ccccgccccg 1320 
cctccgttcg gagagccggc gggcgggcgc ctctcggcca ggtacgcggc cggctgggat 1380 
aggggtcgcg ggcgggcttt ggtcgcgcag cagccggtcc tccccggagg taggcgggcg 1440 
cgggccctgt tgggtctttg ggacgcgggt cccgctgggg ccggggatgc tcctctctgt 1500 
gagcgcggtc ccgtcccccc ctgtccccgg gcgggcgcac gggggtctaa ccttgggggg 1560 
cggcgctccc gcctgtcccg ggtgccgggg ttcgcgtccc gcggggcctt cctcgctctt 1620 
tgtctcttct gagtgaactt gatgaccccc ttcttccagg aagcgcctct tggacgcgtg 1680 
tgaccgatgc ccagattgca cgaccacttc tggagctgct cctgtgcgca cagcgcgagg 1740 
cgccgaggcc ccccgcgagc cagcaccgcg gggctgccgc ccaaggttgg ggagatgatc 1800 
aacgtttccg tgtccgggcc ctccctgctg gcggcccacg gtgccccgga cgctgacccc IRfi0 
gcgcccaggg gccgcagtgc tgcgatgagc ggccccgagc ccggcagccc ctaccccaac 
acctggcatc atcgcctgtt gcagaggagc ctcgtgctct tctcggttgg ggtggtccta 
accctggtgc tcaacctgct gcagatccag aggaatgtca ctctcttccc cgaggaggtg 
atcgccacca tcttttcctc cgcctggtgg gtccctccct gctgcgggac agcagctggt 
gagtacccct cctggttctt ctggaagtaa aaagggtgtc ttttcggttg aaagtacttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



1860 
1920 
1980 
2040 
2100 
2160 



tcagcctata agaaacatgc atttgaaact ggaactatcc acagattgaa agtgtgcatc 2220 

tccagtcttg ggatttagag aaatgaagga ggggtagaga ggacctggca cccccgaggg 22 80 

cgggttcggg acacagctct ctgaagtttg aggaagcaca gtgatggaga tgaggcaggg 2340 

agataattgg ctgttcaagc aggtgctgtg ccaagttgac tgagggccct tccctcctgg 2400 

cgtccgggta cagccccagc tgtcatcccg cgccagccgc cccggattgg ccgcctgggg 2460 

aactggtgag cagcgggact gaaaggccat tggtcgcggc agcaaacatg ctaccctgct 2520 

ggataactga aaacaaaccc gtcacatggg ttcaggaggc ggttgccagg ggagcaactt 2580 

ctcagcactt ccttttgccg ggtcgctgag taaactgtgg tctgaagcca aaccgcctac 2640 

accctgaact gttagagctc tgtagattac gttcttaaga aaagataccc cttacggcgt 2700 

ttagattttc gtacacccaa gcagtagcat agatgctgta caacgtaaca ctgagtcagt 2760 

ctgaactgtg gacacgggga ctgctgggtg tgtgtactag actttcttgg ctatcattga 2820 

cccgctttta tagaaatgtt gagttgatct tggctcagca gatcacttac tgctcctgta 2880 

gtcacacagc atttgtttat ttaacgagct tgtcgggtgc tttctgcaac ctcaactcta 2940 

taagctgttg aaatgttaat ccaaaagtga gcaccacaag cccctatcct taagcagctt 3 000 

agagtctggt ggggaaaaca aaccagtatg tcagtgactc cagtgcaggg cagggagcat 3 060 

cctgtttgct ttgtggctga cgcctgccag ctgactgctg accataggct tcagatgctt 3120 

tgctgtgttc attccttaca gcaatgccat ttgtagcttc atttatagat gtggcaactg 3180 

aatcacagtt gaagtcaagc ccaaggttgg cgcaggtaga aagggcccaa gccatcggtg 324 0 

gtagactgct ccttcttgcc atggtagggt gtgtctcctg cactgtggaa acacattgcc 3300 

tcagttgact cactgaaaac agcaagcagc agagaggttt gtagcagatt aaacaggggg 3360 

ctccgtgttg aagaactagg acaggtgtct gtaactaaaa agtagtgggt agaatgtttt 3420 

catatgtgtt ctcggaggga cgggttcctc tgtcgtttgt cgaaacacat tatgtggctg 3480 

ttctaagtga acactttttg agaacgatat ctagagttga aattccatgt ttcatataca 3540 

gttggcagaa tagtttcttt cctttaaccc taaaacttga tcttagaaat gagataggta 3600 

gccttcagta gttgtaattg gatcctcaag ggtttctagg aatatctgga caatttctca 3660 

tactctcata ttctactttt ttttaaacca tcagccaaga gcataagtaa aatttttcat 3720 

agtatgtttt ttacctattt atttattaag ctatagtgag gtaaagttaa tttggggggt 3780 

ccatttttca gaactccaaa attagaacct gatgaagtat aataaattga aaacaacatt 3 840 

ttttatctga atgtttgatg caaactgtgt ttcttgtttc tcagttgaac ctaaaaggta 3 900 

tatttctttc atactagact gaggtgaaac aattgcaggg ttctcaagag ccctgggaca 3960 

cagtgaccct tgtgtcccag catccaaccc cacggggcgg caccccactc ccccctacag 4 020 

acattcattc ctgccttcat ctttggcatt ttacttctct tgggggaatg ccattccttc 4080 

ccacctgtcc aaattccaag ttcgtgtcta taaagaataa ctattccaga ccacagtgat 414 0 

ctctttctct gtgtcccagt tacataatcc ttataatttt acacatgaga aaacaaacat 4200 

gcagagccac gtcccatcag agggcaagct gggatttgag gctgagtttt ctcttcttgg 4260 

tctggtgttc tttccgcagc ttgaggcttg accatgcggg aaccagatgg agaataagcg 4320 

aagcctgtgt ggggctgtct cgccccgtac ttgcccatgt gcatgtgtgc acagtaggtg 4380 

ctcagtgcct cgtacagaac atgctagaac ccccgaaaat caactccagg gtcactctag 4440 

gagcatgtga cccagactgt caaagtcaca tcagcaaata tgtatctcat cacgtgcccc 4500 

aggcccagta ctgggtggtg gggaggagtg tcaagagctg tgatgtgaca gggggtggac 4 560 

agatgctaac ctttttattg taagatagaa atcacaccgg atgaaaagac aaggtggaac 4620 

tacatagtca tttaagttct gtgtgcttta ggcaaatgtg gatattaaat ataaggttac 4680 

aatattgctc atattaatat gaataatccc tgcttgattc cttcctgtgt ttactgagac 4740 

atgttcaact acactaaata atagtgattt ttaaaaaact tttcccaaac agtacaattt 4800 

tgtgggagtg gtactcaaag agaacaagaa tataaaatca ctctgaagtt atagctgata 4860 

attttttcta ggaaaaggac tggtttgagt aatgtactgt gttactaatg accacgttgg 4920 

aaattctgga acttggaatc tgctgtattc taacattggc aacttttttt aaaacagctg 4980 

ttgttggcct actgtacccc tgtatcgaca gtcacctcgg agaaccccac aaatttaaga 5040 

gagaatgggc cagtgtcatg cgctgcatag caggttttgg tggcattaac cacgccagtg 5100 

ctgtatcctt aattttctgt gctacgtcca gagtatcttc ttaggttata tattgaagca 5160 

ttttgttttg ttatatactc aaatgactcc atgtcataat acagacatga tggtggtacc 5220 

tacccatttc agattactaa aaagagaaag aagaatgggg ctatcgatga cttcatagca 5280 

aaatatattg ttttatggtc aaaataaata taaaattatt taaattttta tatatattaa 5340 
aaaaattttt aaaaatatat ataaaatata taataaatat atatttatat aaatatatat • 5400 

aagcactcca tatatagctt tcagcaagtt cattctcatg gctaagtttc tgactcagcg 5460 

atggcaggga gatttcaggt gatcttaatg attactgtaa cctgcatcca acaaatcctt 5520 

tctttagctc attacacagg ctttatggac agtgaattat ctgtgggaca ataataaaat 5580 




acccafcgttt ttaacccttg tggttttacg tttttttatc ttaataagag tcaccgaact 5640 

tttccttaac ttttatatca ccagaaattg gattttgcca ataatgtcca gctgtccttg 5700 

actttagcag ccctatcttt gggcctttgg tggacatttg atcgttccag aagtggcctt 5760 

gggctgggga tcaccatagc ttttctagct acgctgatca cgcagtttct cgtgtataat 5820 

ggtgtctatc agtatacagt aagtgtgtgt tttcaaatat tggctttgga aagcttagat 5880 

attttcatat tttatttgtt ttttatatag aatgatacag atttaggcat gaaattctca 5940 

aaaatgctgc tatccataac ttaatgtaac gcaaaggaaa accaagtagt agcagttaca 6000 

accaaacaaa tatgaacatt cgtaacattg gatggttgat atgcgttctg catatcccca 6060 

ctacagagaa tctgtgatgt agaattgagc caggtgaaat tgactgctaa ggtacagttt 6120 

cttttctcgc tccaggtata cgtccccaga tttcctctat attcgttctt ggctcccttg 6180 

tatatttttc tcaggaggcg tcacggtggg gaacatagga cgacagttag ctatggtaag 6240 

tgaaatgatc atattatctt ctaaaacttg cttctcttta ccttgataga atgactttac 63 00 

atgatacatt caaatttgcg ttcatatatt ctggttcaga cttgagtgac aactactgag 6360 

aaaagcttgg agtccggatc aagcaacctc ccgcttcagc gtcccgagta gctgggatta 6420 

caggcacatg ccaccatacc cggctaactt ttttgctttt tgtggagatg ttatctccct 6480 

gccttgtcca ggccgggctc gaactcgtgg gctcaagcca tcctcccacc ttggcctccc 6540 

aaagcactgg gattataggc gtgagcctgg ccttgctttt tttaatatta aaaatctggt 6600 

cggatggtcc cacatggaaa tagtatgttg ttttcatgca ttcgccatca tccccgttaa 6660 

tagtcttccc aaaggagagc tggcaacctc catcattgct ttcttgcttt gagatttttg 6720 

tttcgggaac tctctttggg agagaattcc ctaagaaggt ttttaaagaa taaaattgtc 6780 

ctcaaccaga ttgactaatt acacattata actacaactt gaataacatt taaagagttg 6840 

ggttatttct tgtttgatag gttggatgct tgttatatgt caagcctttt taaatcggtt 6900 

ttaagaaata agcattattt tggtggaggt agaaatttca caacattcat aagttctaaa 6960 

catctagaaa tggttggagg aaacattttc gttttgaaga tagtttcatt aagtaatccc 7020 

tgctgtgctc ctttgcttac agaaaaaaat gttaaagata actccccata ggatagttgg 7080 

tgtaattata attctcctta cttcagcttt ctcttcatcc tttttattct ttaagcttgg 7140 

gggaaaatgt ttggatgttc tcgtatctta aaactggagg agagaggctg ggcgcggtgg 7200 

ctcacgcctt taatcccagc actttgggag ctgaggcggn cggatcacga ggtcaggaga 7260 

tcgagaccat cctggctaac acagtgaaac cccgtctgtc ctgaaaat cc ccgaaaggaa 7320 

aaaaaaaata cccaggcgtg gtggtgggca cctgtagtcc cagctgcttg ggatgctgag 7380 

gcaggagaat ggcgtgaacc cgggaggcag agcttgcaat gagccgagat tgtcccactg 7440 

cattccagcc tgggtgacag agcaagactc tgtcgccgaa aaaaaaaaaa aaactggaga 7 500 

agagagatca ccaccttaat ccccttccct actgggctat cacatgaaac tgagctctga 7560 

ggttctaagc ctaacccgcc taccactgtt tttattcata aatacttaaa tccttacctc 762 0 

tggatcccaa ctgttactct ttttttcact tcaaaatttg caaactcatt tctatccatc 7680 

aaaccccgga aatggcccag gggcttcaca gtgcagtccc ctccctcctg ctgcatccgt 7740 

tcctcaggct gtgccccagg ttgcattcgt ggtgcgtgtt ttctcccagg gcagccttcc 7800 

ccacattcca ctcttgtcat actgttgact cacaaaatgg aagtggctga atgttccagg 7860 

gatagtgtct aagagctatt ggtttaaggt tgaaacctac accaagtaag gctttgtgaa 792 0 

gggagatgac gagagtgaag ttgttgacag tctgaaggtg aagagattcc gagggtcagt 7980 

tcagcaccgg gaccagaccc gcccagatcc agggtaattt gaggtctgtg tttgggtgct 804 0 

gatggttttg gccagtggac tccacaacct ctccttctta gctgtctcag gtctcttcag 8100 

atgccttggg tcttaactta tgcttccctc ccttaagaag aaacatagtt aattggcaag 8160 

aataaccatg ctcaaaaagt tacaagttgg gtttgggttc aagttatatg ggcttcctag 822 0 

cttttctatg tggtatttcc ccccacacat gccgttgtcc tttgtgtccg aaaatacagc 8280 

agaaacattt gttatcagcc tgttacggtt tttttgtttg cttgtggttt gtttttattt 8340 

gtactttttt ggagacggag tcttgctttg tcacccagac taaagtgcag tgggacaatc 8400 

ttggctcact gcatttgaac cgtctcccag gttcaaatga ttatcctgcc tcagcctccc 8460 

aagtagctag ggatcacagg gcgcccacca ccgcacctgg ctaatttttc tgtttttagt 852 0 

agagacgggg tttcgccacg ttggccaggc tggtctcgaa ctcctgacct caggtgatcc 8580 

acctgcctct gcctcccaaa gtgctgggat tacaggaatg agacaccgcg cccggcccag 8640 

tctgtatgtt taaaaaggaa aatgagtctg taaccatcat ttattagaac ttgtagtatg 8700 

ttgctgtgtg tagcttcaag tccttcctca agagtaagct gcaaactgcc aacagaaatc 8760 

aagcctgtcc tgttgacact gcccggcccc actgggacac ctggagggag agtccccccc 882 0 

acctcagtgt cagattggaa tttgatttgg cggtcctgtt tgtgattgac ccaatacagc 8880 

tgctaatgat aataatacta ttattgttat tgagggttta catttgttgg gctcttactg 8940 

ctgcacagga ggcattgtgc tgagcgtttc cattgttagg cagtttagtc ctcatacaca 9000 



agtctttgga 
agattaaaca 
agcataccac 
cagcagcctg 
ctggaagtcc 
gtctctgcca 
tctatttatt 
tcccgggttt 
gaataagaac 
cctttgttcc 
ttttgtcccc 
actttcttac 
tgagtaatct 

a gggcaatta 

tctcttttgc 
tttgcttctt 
catagtagta 
catgtagaaa 
agtattacaa 
ttataacaca 
ggttcaatcc 
catgctttga 
caagtcttac 
gcagagaatt 
gacggcgcag 
ctctcaatct 
taattggtaa 
ctttctacac 
attctgagag 
cttttttctt 
ctcgtcacct 
gtcttccctt 
cgtgggttca 
agttgccata 
gtgcccaaag 
aatatttatc 
aaaagcagca 
tcacggtcac 
accagtgggt 
agcaaaaggc 
agtttatgaa 
tatttcaatc 
ttttggcaca 
atggcactgc 
catttaatgg 
gtgcaatgta 
acgtgtgaga 
tcggcaatga 
caaccttcga 
ggtttttgga 
aaa 



gtagaaacta 
gcttgaccag 
actgccaata 
cgttggtgtc 
ctgtgccacg 
ggcctcactg 
cagtacatta 
gcttgcccaa 
cccctcccca 
aattcccttg 
caggggtagt 
ctgccctggt 
ccgtcattgc 
agtaacgtgg 
cctttagcag 
catcagtgaa 
actgtagtgt 
ggatttccca 
agcatgacct 
ggcatcagat 
cttaaattgc 
taaaattggg 
acttacatgg 
atcctaattt 
gggagctggg 
tgttgcttca 
ttaaaatatt 
atagggtgtt 
caaggaagat 
aagtaatcta 
atctagaaaa 
gctacccaca 
gtcaggtgcc 
tttcaaagcc 
gagggaggtt 
caaaagattt 
gtgattatag 
gctcgtccgt 
caactgcttg 
aatgaaaagt 
gtttatttcg 
ttagtgagcc 
gccctatgga 
attaaacgtg 
catagaccat 
ttatgaaaac 
tgtcccgaga 
tttcttcagt 
aatactccag 
atctacatgt 



ttattagccc 

gtgggtggca 

ccaacacaat 

tgggttgcca 

tgtcaggtcc 

gggtctctgt 

gggaacaggg 

gaagtttgcc 

gttgtgcaaa 

aaagcattgt 

accctggcaa 

tttgttcaag 

tgaagtgagg 

gaagtcacga 

aagctccgtg 

atactgttct 

ctacctctca 

actgtgaagc 

atttccaatg 

actaataaag 

agttagaact 

tgtttccttt 

tggctttaat 

tgcaggtgag 

agagctggcc 

gtcaaagaaa 

tatacattta 

cctgaaaagc 

tttggaagaa 

tttagatcgg 

gtcaagaata 

ctggcgcctg 

accttcccaa 

ttgagctaag 

gatggtgctt 

ttaaaaatag 

agaggtcaca 

cctgcagtgg 

tcattcctcc 

cttgggctcc 

aaatccatag 

aaaattgttt 

atttgcaatc 

tggtgtttct 

accagaccta 

caatctgagc 

gaagggttct 

tgtgaagttc 

ttttgggggt 

cttgggggcg 



atttctaaaa 
gaggtcgacg 
gctcgcactg 
tttacaatgt 
gccccggcct 
cgaccaggga 
agtcaaattt 
cgttttttgt 
aacccaagaa 
attgtttccc 
taccctggtt 
atgagaggct 
agctcagctc 
ttctgaggga 
acctcaggaa 
ccattaagaa 
gaagctctgt 
tctgtgtgaa 
tctgtttcaa 
gtttgtggca 
ttcaggaacc 
atatagatat 
aagggcctca 
agctgtggct 
tcggtctcca 
gtgtcaatat 
gtgctaattt 
cccatagtga 
aatctgactg 
gctgactgta 

gggaggtgga 

tctgtgccct 
gtattcgatt 
actcaattac 
aacaaacatg 
ggctgtgttt 
ctctaagtgg 
cgtgtttaca 
cgcggcagtg 
cacactgcga 
tcatctaaga 
gtttgttact 
tgtgattgcc 
agtcaatgat 
atttgcaagt 
cttgtatctc 
ccattcattt 
ctttcgggtt 

tggggccatt 

ggctcaaatt 



ggggaattag 

ttgcaagccg 

tctcacctgg 

gccaccatta 

ggctgtcgcc 

ttctaaaagt 

ttgttggccc 

tttccccccc 

acttgggcag 

taaggaagtc 

gactccaacc 

cttttggcct 

cttcctctta 

aaaaaattct 

ggtcatttaa 

tttaccctga 

gagggtcaga 

ttgagggcac 

caaactgctg 

gaaacctccc 

tatcataccc 

taatttagtt 

gcagcgtaat 

cacggggcca 

gttctctgca 

ttcccatgtt 

tctctttcct 

ttgagtcttc 

tggattatga 

caaatgactc 

gaatgatgac 

ggagcattct 

tcattcatgt 

caacccgcag 

aagtatggtg 

aaagaaggaa 

ggtcgcggcg 

tggtcacacg 

ttgtgtagac 

tatattggaa 

atgaatacct 

acagaataga 

ttgtaaaaag 

attggtgagc 

attgggtctt 

ttaaatattt 

cagtgctgcc 

acaacctcca 

tacttataaa 

cttcgaaagt 



cagcttagga 

acttctcatt 

aagcaggtta 

ggcaacactg 

cgcctctggc 

ccacttccca 

aagtttcctt 

cccacggttt 

ttccttttca 

cttaattgtg 

cagactcatg 

gttttctgga 

aagtgagtgg 

gtatggagtt 

tctctctgtc 

ggtgcctggt 

tgaggaagaa 

acagagcttc 

gtaaggtggg 

ttcagataaa 

taagcatcac 

aaactttcac 

ttcacctgag 

gagccatgaa 

catgtcccac 

aggacctact 

tttttttttc 

aaaaccaccg 

caaagattat 

ctggaaaaaa 

ttaccctgaa 

gcccagccta 

gattaaaaca 

ttttgtgtca 

taataggaat 

tcaaaacaag 

tggccacgct 

tgtgtgtatc 

aatcttactg 

ttcccacctc 

gtctgccatg 

gatgactgtt 

gagagtgcat 

acaatgtatt 

caaacttcaa 

attccttcta 

tggaggaaac 

ctggaaccct 

tttaccgccg 

ggttggatta 



9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12003 



yy35b09.sl Scares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE: 273209 3 
mKNA sequence. 



ncagcatttt tcggcctctt tatttagaac ccggcggacg aggggccggg gcagtggtac 
agacggctca ggalccattt taacagactt gtcttcaagt ttcagataaa cacagtcata 
a?aaSgaga cagcgaaanc cgaagagact gcaagctaga tgggcatgta tggcagctac 
agcSSgag tgaccccctt ccccagagtc cgcgatgaaa ataaagttac acttgtcaat 
alccala?g? gggagatgga gagtgccttt gnantaacca -taaccgagc tagtgcgtgg 
cagagcggtc cacgccttgg acataaatag aaaatataag ttagtataac ^ttaaaaact 
ttttgtacaa atatacatgg tttttttant ttttecnttt ttttttcctt tttccttttt 
ttgcactgag tttcagcaga gattaaacat tttatat 



60 
120 
180 
240 
300 
360 
420 
457 



s ======= 



SS^r 9 ' accca 9999 t gsggggtgga g 9 cggctcct gcgatcgaag gggacttqac 

i S5S 555S5 SSS 5S55 £S 
II SSS M2 S2SS 2232 S 
£ E5 5=5 =5 = S6 

! tatcactgac tgatgaaaat gctctttctg gaaatgagga actaacaotr 
accagagSo tSSZ?* SWect* ctgcatgtca LgacISgg Jg^ggStg 
ttaaaclaaa E2T aaaccttggt accatagcca aatctgggll aagcgagtt? 

g£i SHS i™ ~ SS £55 

ZZZSSS a *™*° M «9«9999« acgacaatt. cecttgtctt alaagaaSa 

K2E SSS =SS2 2SS£ «S 

S HiS iPFF = =£ 3 
sb£ £55 S? =sa 5 =5 
safe i S S= =sss 5sss 

^ a 9tgx:ai: aaaaataaaa aagatcccaa ataaaaaaaa aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1381 



Homo sapiens tumor suppressor deleted in oral cancer-related 1, mRNA (cDNA 
clone MGC:3779 IMAGE : 3659410) , complete cds . 

/translation= n MSYKPIAPAPSSTPGSSTPGPGTPVPTGSVPSPSGSVPGAGAPFR 
PLFNDFGPPSMGYVQAMKPPGAQGSQSTYTDLLSVIEEMGKEIRPTYAGSKSAMERIjKR 

GI IHARALVRECIjAETERNART " 

gcgcgcaagg caccggtggc agcggcgacg gcagctgcga cagcaacccc tgctgggccg 60 

aaactgggca gagcggagca gacgtctgaa gcagcgcgag tgaggcgcga gggtagcgcc 120 

cgcgcccggg aagacccctc ggcgcgaacc ggcagcccag ccccgggtcc cggttcccaa 180 

ggccccgcct ctagggcctg gggactaatc ggattgagag cgcgccggcc cgggccgcga 240 

actcgccaat tgcggagggc ggtggccacc gcccaatccg gagcagacag gtgcgaggtc 300 

cggaaggcgg aggccaatcg gcggcggttg cgacctgctg gggcaggtct cggccaataa 3 60 

ggaggctcga gtgacatctt cgcgcaccaa tcgggagtga gggagcattc gtgcccgctc 420 

gcccttccgg ccagacctct atttaccagg ggcgtgcagc ccgcttgcca atcagagcgc 480 

ggctgagcgg ccccgcagcc aacccccgag gagcggccgg ctggcgtccg ccgcgcccag 540 

gagttgggga tgtcctacaa acccatcgcc cctgctccca gcagcacccc tggctccagc 600 

acccctgggc cgggcacccc ggtccctaca ggaagcgtcc cgtcgccgtc gggctcagtg 660 

ccaggagccg gcgctccttt cagaccgctg tttaacgact ttggaccgcc ttccatgggc 720 

tacgtgcagg cgatgaagcc acccggcgcc cagggctccc agagcaccta cacggacctg 780 

ctgtcagtca tagaggagat gggcaaagag atccggccta cctatgctgg cagcaagagc 840 

gccatggagc gcctgaagag aggtatcatc catgcccggg ccctagtcag agagtgcctg 900 

gcagagacag agcggaacgc ccgcacgtaa caggaagcgc ctcggcctca gcgtctggac 960 

ctatccggcc actgcagagc acccgcttct ccctggcctt catcccgagt tgcactaacc 1020 

atcctgggct tcctgtcctg tgtcccttgg tgggtcccct ccaggaacca aggagtggcc 1080 

ctccaggtgg cagcactaag gacacccccc cacaacaaga gttagcagcg aggtccccat 1140 

gagtcccacc catgacctgc cgacagtgtt gcccaccgga acttttgtgg cccctaccgc 1200 

tcagcccttc ccagcacttc tcccactttg tcccgagcct ccttctcccc cagcaggggc 1260 

acaggcctgg cacctccctg ccttgtgtcc tgagccatag tgactctttt atctgtgtgt 1320 

cttttgctaa atatgccctt tttatattaa taaaagatga tttggagttg tgctctcaaa 1380 

aaaaaaaaaa aaaaaaa 1397 



E Homo sapiens TNFR-related death receptor-6 (DR6) mRNA, complete cds. 

r / 1 r an s 1 a t i oils " M6TS PS S STALAS CSR IARRATATM IAGSLLLIiGFIiSTTTAQPEQ 

KASNLIGTYRIIVDRATGQVIjTCDKCPAGTyVSEHCTNTSIiRVCSSCPVGTFTRHENGIE 
KCHDCSQPCPVWMIEKLPCAALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTET 
EDVRCKQCARGTFSDVPSSVMKCKAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTS 
PSPGTAIFPRPEH!ffiTHEVPSSTYVPKG^STESNSSASVRPKVl.SSIQEGTVP0lJTSS 
ARGKEDVNKTIiPNLQVVNHQQGPHHRHILKLLPSMEATGGEKSSTPIKGPKRGHPRQNIi 
HKHFDINEHLPWMIVLFLLLVLWIWCSIPJCSSRTLKKGPRQDPSAIVEKAGLKKSMT 



PTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIYQFLCNASEREVAAFSNGYTADHER 
AYAALQHWTIRGPEASLAQLISALRQHRRNDVVEKIRGIjMEDTTQLETDKIiALPMSPSP 
LSPSPIPSPNAKLENSALLTVEPSPQDKNKGFFVDESEPLLRCDSTSSGSSAI.SRKGSF 
ITKEKKDTVLRQVRIjDPCDLQPIFDDMLHFIiNPEELRVIEEIPQAEDKIjDRLFEIIGVK 
SQEASQTLIiDSVYSHLPDLIi " 

atggggacct ctccgagcag cagcaccgcc ctcgcctcct gcagccgcat cgcccgccga 60 

gccacagcca cgatgatcgc gggctccctt ctcctgcttg gattccttag caccaccaca 12 0 

gctcagccag aacagaaggc ctcgaatctc attggcacat accgccatgt tgaccgtgcc 180 

accggccagg tgctaacctg tgacaagtgt ccagcaggaa cctatgtctc tgagcattgt 240 

accaacacaa gcctgcgcgt ctgcagcagt tgccctgtgg ggacctttac caggcatgag 300 

aatggcatag agaaatgcca tgactgtagt cagccatgcc catggccaat gattgagaaa 360 

ttaccttgtg ctgccttgac tgaccgagaa tgcacttgcc cacctggcat gttccagtct 420 

aacgctacct gtgcccccca tacggtgtgt cctgtgggtt ggggtgtgcg gaagaaaggg 480 

acagagactg aggatgtgcg gtgtaagcag tgtgctcggg gtaccttctc agatgtgcct 540 

tctagtgtga tgaaatgcaa agcatacaca gactgtctga gtcagaacct ggtggtgatc 600 

aagccgggga ccaaggagac agacaacgtc tgtggcacac tcccgtcctt ctccagctcc 660 

acctcacctt cccctggcac agccatcttt ccacgccctg agcacatgga aacccatgaa 720 

gtcccttcct ccacttatgt tcccaaaggc atgaactcaa cagaatccaa ctcttctgcc 780 

tctgttagac caaaggtact gagtagcatc caggaaggga cagtccctga caacacaagc 840 

tcagcaaggg ggaaggaaga cgtgaacaag accctcccaa accttcaggt agtcaaccac 900 

cagcaaggcc cccaccacag acacatcctg aagctgctgc cgtccatgga ggccactggg 960 

ggcgagaagt ccagcacgcc catcaagggc cccaagaggg gacatcctag acagaaccta 1020 

cacaagcatt ttgacatcaa tgagcatttg ccctggatga ttgtgctttt cctgctgctg 1080 

gtgcttgtgg tgattgtggt gtgcagtatc cggaaaagct cgaggactct gaaaaagggg 114 0 

ccccggcagg atcccagtgc cattgtggaa aaggcagggc tgaagaaatc catgactcca 1200 

acccagaacc gggagaaatg gatctactac tgcaatggcc atggtatcga tatcctgaag 1260 

cttgtagcag cccaagtggg aagccagtgg aaagatatct atcagtttct ttgcaatgcc 1320 

agtgagaggg aggttgctgc tttctccaat gggtacacag ccgaccacga gcgggcctac 1380 

gcagctctgc agcactggac catccggggc cccgaggcca gcctcgccca gctaattagc 144 0 

gccctgcgcc agcaccggag aaacgatgtt gtggagaaga ttcgtgggct gatggaagac 1500 

accacccagc tggaaactga caaactagct ctcccgatga gccccagccc gcttagcccg 1560 

agccccatcc ccagccccaa cgcgaaactt gagaattccg ctctcctgac ggtggagcct 1620 

tccccacagg acaagaacaa gggcttcttc gtggatgagt cggagcccct tctccgctgt 1680 

gactctacat ccagcggctc ctccgcgctg agcaggaacg gttcctttat taccaaagaa 1740 

aagaaggaca cagtgttgcg gcaggtacgc ctggacccct gtgacttgca gcctatcttt 1800 

gatgacatgc tccactttct aaatcctgag gagctgcggg tgattgaaga gattccccag i860 

gctgaggaca aactagaccg gctattcgaa attattggag tcaagagcca ggaagccagc 1920 

cagaccctcc tggactctgt ttatagccat cttcctgacc tgctgtag " 1968 



601848574F1 NIH_MGC55 Homo sapiens cDNA clone IMAGE : 4079202 5', mRNA 
sequence - 

acaatggtat agatttcaca acacaaaaag gacattggtg gatgttactg cacattttaa 
attcttaaca ctaatttatc tgtataagtg tttatatgca tattttggga cataaacagt 
ttatgtaaaa ttagtaatga atgatggcaa cgagggcact gttatcttcg ttgttttcaa 
tgatcattta gcattcaatg atggaacagc tggtataaca taagtggtcg gcatgaaata 
tttgagatcg aaacttctgt gccttgaaca gaacttatat cttagattct ctctcacatt 
ttctgtggag ctggggttga ataggaacca gatgatgttc actgctgaaa ttccataatg 
cttcccattg aagggaagtg agaaccagga aagctgcttt cacgtcatgt gccatccagt 
actgacaggg aagaaagatg tagttttcca gtagtgatga atcacattat gaattacatt 
tcttcttaag aagtaaaaac tcagaatgta ccatctgtgt ttcctttcag ttcattaaat 
ggcatcataa cagatgactt gtgctaagtt caatagagtt accacatctt ttactattat 
gcaaaaatat taactttaat gaaccattgc ttggacatga tttcctatac attaccattg 
ggccgaatgt gttggtcata ctatcacgca ctaaacctgg gtgtttacac tgggcaccgc 
gcttcaccgg gcataaggcg gacaacggtc ttaggcaaac tcgggtcctc gaaac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 

775 



DE Homo sapiens clone PP1722 unknown mRNA. 



™ ix™«i ati0n= "MQY^AATAASGAFVPPPSAQEIPWSAPAPAPIHNQFPAENQPAN 

QNAAPQVWNPGANQlHiRMNAQGGPIVEEDDE INKDWLDWTYSAATFSVFIiS ILYFYSS 
LSPJLMVMGATVVMYLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNLQEGTDPETE 
DPNHLPPDRDVLDGEQTSPS FMSTAWLVFKTFFASLIiPEGPPAIAN " 



FT 
FT 
XX 
SQ 



Sequence 2217 BP; 612 A; 460 C; 463 G; 682 T; 0 Other; 

° Caggc ^ ttt: ctcaaactcc tgagggcaag cgatcctccc acctcagcct 
cctgagtagc tgggactaca ggcatgtgcc actagacctg gctctaaaga catatatgac 
acacgaaacc atttattttt catttcacaa tgtttattca catatatggt attagtaftc 
taatgtagtg atgcactcta aatttgcatt atatttccta gaacatcga acagagcata 
ggaaattccc tattttgcca ttatcagttc taacaaaaat cttaaaagca ctt?a?catt 

9»«*Staat ttttttaaat gatcaaaaac agtateaLc caaggcttac 
tcaaatS at3C * att " «9— gttgt gggctgggtt gtatttataa .tcSgttgg 
^^ 9at S Ct 9caatgagta aatttagcac cattatcagg aagctttctc accaatgaS 
222"! aa9a " ttaa tgaaagtgta gcatactcta gggaaaaaat atgaatfttt 

9tattgaaaa ttatgttgaa taaatgtcag actatttttt acltaacgtt 
^" 9tCa c 9 ttca 9 a 99 tggggggtag gagatgtaag cccttgacag 

"" tt9 r t gatttca 9^ agttgcatca gctcctttgt tctgtgttc! 
tgttacactt atttaggtgg ctgaatccac agaggagcct gctggttcta atcggggaca 

SSSS ? a o tCCtCaa St r t99ttt aaggcaaa 99 gaag?tc t tc g^SSS 
;S S tgggaaaaca tctcaaggcc tgaagctgcc cagcaggcat tccaaggcct 
?! 9 ^!! 99t ttc ^ggtt acacacccta tgggtggctt cagctttcct ggttccagca 
?2S a S« C9aCa ? tact a catgcaata tttagcagcc actgctgcat clggggcEtt 
tcacaaccao ?* aagtgcac aa 9 a 3atacc tgtggtctct gcacctgctc cagcclctat 
tlltttl^f * ttcca 9 ct 9 aaaaccagcc tgccaatcag aatgctgctc ctcaagtggt 
aaaaaato^ 9gag * Caatc aaa *tttgcg gatgaatgca caaggtggcc ctattgtiga 
a f aa ? a ^ 9at 9aaataaatc gagattggtt ggattggacc tattcagcag ctacatt??c 
aaccacca^ S??* acttctactc ctccctgagc agattcltc! tggtcatggg 
?TJ,lt,??r. 9ttatgtacc tgcatcacgt tgggtggttt ccatttagac cgaggccig? 
tcagaacttc ccaaatgatg gtcctcctcc tgacgttgta aatcaggacc ccaacaaSa 
cttacaggaa ggcactgatc ctgaaactga agaccccaac cacctccctc cagacaggga 

gSttS?^ ^ a9CCCCtC ctttat 9 a 9<= acagcatggc ttgtct?caa 

Itllllt gcctctcttc ttccagaagg ccccccagcc atcgcaaact gatggtgttt 
llntl^l tgtt99a 99C tttgacagga atggactgga tcacctgact ccagftagat 
tgcctctcct ggacatggca atgatgagtt tttaaaaaac agtgtggatg atgatatgct 
tttgtgagca agcaaaagca gaaacgtgaa gccgtgatac aaattggtga acaaaaaatg 
aaacaSS ct * a t3tc tt tattctgaag agctttaata tatacfctat gtagtJtaa? 
Sf^SS 3 = gta ? aa 99c cttaggtgtt gcatgtctat gcttgaggaa cttttccaaa 
tgtgtgtgtc tgcatgtgtg tttgtacata gaagtcatag atgcagaagt ggttctgctg 
ZSSSSi a " CCt9 " g 9aatgt t taa attacactaa gtgtactact SatatLtc 
aatgaaattg ctagacatgt tttagcagga cttttctagg aaagacttat gtataattgc 
tttttaaaat gcagtgcttt actttaaact aaggggaact ttgcggaggt gaaaaccttt 
gctgggtttt ctgttcaata aagttttact atgaa?gaoa aaLSaSa faaataa 



60 
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180 
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300 
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Homo sapiens hypothetical protein FLJ11259, mRNA (cDNA clone MGC:8787 
IMAGE : 3 925141) , complete cds. 

/ trans lat ion= » MGIVANFQELAVPVVHDGGALLAFVCGVVYTIiLQS 1 1 S YKSCPQW 
NSLSTCHIRMVISAVSC^VIPMIVCASLISITKLEWNPREKDYVYHWSAICE 
GFIFYFLTFIQDFQSVTLRISTEINGDI " 



Sequence 2388 BP; 725 A; 460 C; 523 G; 680 T; 0 other; 

gcaaaatcaa acctgctatt tcagcactcc tgtttttaac ttggtgtctt tagtgcttgg 60 

attggtggga tgtttcggaa tgggcattgt cgccaatttt caggagttag ctgtgccagt 120 

ggttcatgac gggggcgctc ttttggcctt tgtctgtggt gtcgtgtaca cgctcctaca 180 

gtccatcatc tcttacaaat catgtcccca gtggaacagt ctctcgacat gccacatacg 240 

gatggtcatc tctgccgttt cttgcgcagc tgtcatcccc atgattgtct gtgcttcact 3 00 

aatttccata accaagctgg agtggaatcc aagagaaaag gattatgtat atcacgtagt 360 

gagtgcgatc tgtgaatgga cagtggcctt tggttttatt ttctacttcc taactttcat 420 

ccaagatttc cagagtgtca ccctaaggat atccacagaa atcaatggtg atatttgaag 480 

aaagaagaat tcagtctcac tcagtgaatg tcgcaggcca tttctaaaag tgctacagag 540 

gacagacagg gttttgaggc caccctgatt attgggatgc atctgcagca catccaggac 600 

ttgaatttca ttacgagttc ctaatagttg tatttctaaa gatgtttcct agagaatgta 660 

cagccttatg acactgtagt gatgttttta taattttcta agtagatttt tttatattaa 720 

caaattcata tacagaaaaa ataaggtgtt acaaaaaatg gagagctctt atttttgtac 780 

agattctgtc gttttttttt tatttgtgtg agatttatgg aaatacacta aatgagtaat 840 

tcaggttcag tacatttatt acaaagtgaa atcaggggat attcatttgt aaattttatt 900 

cttagtgaat gaactgtata atttttttta tcaggagagc acttataaaa ttcaatttat 960 

aaagatcata tacccaaatc ataaagattt agttgataca ttaacactaa gatactctga 1020 

tttttagcca aactaaacaa agtgcttcta ctgagaggcc tttataccac catgtacagt 1080 

aactctaagt gaatacggaa gaccttggtt ttgaaattct gccaccttgt ttctccctgc 1140 

tcatgaggtc gcaccttttg ctcttgctgc taattgccca ttcttagtgg gtgtaatgcc 1200 

aggtggaatg gtttcaacaa gtcaggtgaa aaccatcctt tattgttgct ggcacaactt 1260 

gatatatagt ctgactcaga actgaagctc acatctcaaa ttcatttcat gccagtaaat 1320 

gtggcaaaga gaagaaaggc ccaagagcga gacaagaaga atggagaagg gggcagccaa 13 80 

gaagaacttc tgggttcagg gtactgttta tttgctcctt ctcttcatgc ctgtggctgg 1440 

atgtcccaca acactataag aaatataagt caagcccttt gtgttaagca agaactacag 1500 

actccatctt ttcacccaaa tcatgaatga ccaataaaaa gcaagttatt ccagaggaag 1560 

aagcagccct tgaaaaataa ggcttaggct tgaaaggtga agagcaggaa ttctctcttt 1620 

caaatcctag agcataaacc catgtgtggc caagtgagat cagccctcaa gggcacatgc 1680 

caagggcaga gcagcccatg tagacagctt cggagggcat gggggtgtag ggagttcggg 1740 

gtagctcctc attaactatt tgttgggtga gtaaaggggt gaggctcagt ggcaggtacc 1800 

tctgcaatga caagctgcct cccctctatg tgtttagcat atgttattag aacatgtccg 1860 

acacccctac cgctgccatt tgggcccttt aataaagcca agtagagaaa tctggcaata 1920 

aaaggcaaat gtaagcatgc tttctttaag acgcatcata aatggttttc tttaagtgaa 1980 

tggaagagtt tgacagagat acacctttgt aagaaaacat taagaatgct ggctggctgt 2040 

ggtggctcac acctgtattc ccagcacttt gggaggccta ggcaggagga ttgcttgagc 2100 

ctgggacttc gagaccagac tgggaaacat ggcaaaatcc catctctaca acaaaaatac 2160 

aaaaattagc caagtgcggt ggtgtgcctg tagtcctagt tacttgggag gctgaggtgg 2220 

gagaatcacc tgagcccagg aggtggaggc tgcagtgagc catgccaatg cactccagtc 2280 

tgggcaacag agtgagaccc tgtctcaaaa ataaataaat aaataaatga ataaagagaa 2340 
tgctaatcca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 



2388 



-E tq65cl0.xl NCI_CGAP_Lul9 Homo sapiens cDNA clone IMAGE : 2213682 3» 
t0 SW:ENPL_HUMAN P14625 ENDOPLASMIN PRECURSOR ;, mRNA sequence. 



ttttttttcc 
ttcttcctcc 
gtttatgaac 
cagatgcttc 

ttgggtcagc 

tgttgtgttt 
caccaaactg 
tgtttaaaaa 
ctcggtcatt 
aattttgact 
ccttatctta 
atttttatac 
ttttccgact 
ctttctcgga 
ttgcccatcc 
aacccccagg 
caatttgccc 
ttcccccatc 
tcctccctcn 
tctcaccccn 



tctactgcag 
atgggctcct 
tgtgaatatt 
ttcttttaag 
aattacagaa 
tgaagtgaca 
gccaatcaat 
ctcgcttgtc 
cctacaccgn 
tgtagttcct 
tctaaagcat 
aggaattggt 
ctctctaagt 
ctacttcata 
cataacttaa 
cccctttggg 
gggccaaatc 
cccaaaaacc 
cccaaaancc 
c 



cttcatcatc 
caacagtttc 
tttttgacga 
acaagggtaa 
aattcattgg 
ataaccttat 
tcagaagttg 
ccagatttgg 
tgtctgtgac 
catttccaga 
caaaagcaat 
gataggttca 
ctctaattgg 
tcccggcctt 
ttgcagcttg 
tgcgggcgaa 
ttattggggg 
ccggaaaagg 
ccgcctatta 



agattcttct 
agtcttgctg 
gatttttaat 
ttgtcgttcc 
agtcagactc 
ctgctacaag 
actggccatc 
ctatggtacc 
atgcagcagg 
aagagcattt 
tgaaatcagt 
tcattctggt 
gagcattaaa 
gactttctta 
accgaccgaa 
attaatacct 
ttaaaaaaaa 
gaaggggggc 
aaacccggga 



ttctcttctt 
ctccatacat 
tgtatccaat 
ccgtcctaga 
ccagatgtgc 
gaaggcggaa 
ttcctgtgct 
aaggttttta 
ttcttctcct 
tcatcagtca 
tctctcaaga 
aactccgctt 
tcatcaactg 
cttttcccca 
gtaanaggac 
ctaatcaggg 
attttattgt 
gttaggggaa 
gggaaangtn 



tggctgcttc 
aaataggaaa 
tcaaggtaat 
gtgtttcctc 
tgagtatcgt 
tagaaaccga 
tcagtcattt 
accaactctt 
tatcacactt 
gtgatattag 
aaatctcttt 
ggaaggcaac 
atagcttctt 
aacccttttc 
ccaaaggccc 
cccccttggc 
ttggggaaag 
caatattggc 
ttccctctcc 



similar 
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1140 
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Homo sapiens phosphoserine aminotransferase (PSA) mRNA, complete cds. 

/ 1 r ans 1 a t i on= " MDAPRQVVNFGPGPAKLPHS VLLE I QKELIjDYKGVGI S VIjEMSHR 

SSDFAKIINNTENLVRELLAVPDimCVIFLQGGGCGQFSAVPLNL 

GAWSAKAAEEAKKFGTINIVHPKLGSYTKIPDPSTI^^ 

FIPDVKGAVLVCDMSSNFLSKPAHDVSKFGVIFAGAQ 

ECPSVLEYKVQAGNSSIiYNTPPCFSIYVMGIj^ 

IDNSQGFYVSVGGIRASLYNAVTIEDVQKLAAFMKKFIiEI^QL » 

ccttggctga ctcaccgccc tcgccgccgc accatggacg cccccaggca ggtggtcaac 60 

tttgggcctg gtcccgccaa gctgccgcac tcagtgttgt tagagataca aaaggaatta 120 

ttagactaca aaggagttgg cattagtgtt cttgaaatga gtcacaggtc atcagatttt 180 

gccaagatta ttaacaatac agagaatctt gtgcgggaat tgctagctgt tccagacaac 240 

tataaggtga tttttctgca aggaggtggg tgcggccagt tcagtgctgt ccccttaaac 300 

ctcattggct tgaaagcagg aaggtgtgcg gactatgtgg tgacaggagc ttggtcagct 360 

aaggccgcag aagaagccaa gaagtttggg actataaata tcgttcaccc taaacttggg 420 

agttatacaa aaattccaga tccaagcacc tggaacctca acccagatgc ctcctacgtg 480 

tattattgcg caaatgagac ggtgcatggt gtggagtttg actttatacc cgatgtcaag 540 

ggagcagtac tggtttgtga catgtcctca aacttcctgt ccaagccagt ggatgtttcc 600 

aagtttggtg tgatttttgc tggtgcccag aagaatgttg gctctgctgg ggtcaccgtg 660 

gtgattgtcc gtgatgacct gctggggttt gccctccgag agtgcccctc ggtcctggaa 720 

tacaaggtgc aggctggaaa cagctccttg tacaacacgc ctccatgttt cagcatctac 780 

gtcatgggct tggttctgga gtggattaaa aacaatggag gtgccgcggc catggagaag 840 

cttagctcca tcaaatctca aacaatttat gagattattg ataattctca aggattctac 900 

gtgtctgtgg gaggcatccg ggcctctctg tataatgctg tcacaattga agacgttcag 960 

aagctggccg ccttcatgaa aaaatttttg gagatgcatc agctatgaac acatcctaac 1020 

caggatatac tctgttcttg aacaacatac aaagtttaaa gtaac 1065 



IE 



Homo sapiens cDNA clone : ADBAPE04 , 5 -end, expressed in human adrenal gland. 



aaagaaactg 
attatgtatt 
ataccactgt 
tctctctttc 
tttttccact 
agaatgtgat 
gaaagattta 
gtggtttact 
agcgcaaaaa 
gttgcctcgt 
gccacacaaa 
ccatatccct 
acttgcctac 



gttggtttta 
gttttactct 
cattaatata 
tcatagactg 
caaattaagg 
tctttcactt 
ttttggtaaa 
tcaacgattg 
tggaacggtt 
tgtagttggt 
gcaaacagat 
ttgcaggatt 
tggactccac 



agaaaatagt 
gattaggtta 
ttaaaaagat 
accttctttt 
gcaagcgaaa 
ggttattaca 
agattttgct 
aaacattttc 
gattgaaatn 
gggccccgta 
ctgcatcgat 
taaaatattt 
ctgggtgacc 



ttcaagaagt 
ctgtgatagg 
gtatgtgtta 
ggaatttctg 
aagtaataat 
ctggctcgtg 
ttacttttcg 
ctattaaaat 
tactctttct 
gcatggatgc 
cgcaatttct 
aaaatggcct 
aat 



tcaactatat 
catttattca 
gactatcgaa 
agtcatttat 
tcggcattct 
gacagaacat 
aagcattatt 
ttcattgtta 
gtgaagaaaa 
ctttgccaat 
tgtgaacacg 
gccttgagtg 



tcttttagat 
tattctttct 
agggccttat 
tttccttagc 
ttaagcctac 
tttgaaaagt 
cttttaaaga 
gaatcacagg 
tcacagagtt 
gggttcatgt 
gattgcatgt 
cgatgagcca 



60 
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180 
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300 
360 
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wd68f02.xl NCI__CGAP_Lu24 Homo sapiens cDNA clone IMAGE: 2336763 3», 
sequence. 

tttctgtaca atacacattt attgagcact agatatatgc catgctagat gcaggtgacc 
cagagcatca aggagcaata gtctggtggc agagacacac acaatgtcac tgtgatgtat 
taaagcagtc agcaatagat gcagctcagg gcactgtggg gatatccaga ggcacagtac 
cttctgcctg tcagtcaggg agggagagga gcacaggctg aaggagactg gaagacagca 
gttggcctct gatagtggga ctggagagag atttctaagg gccacttctt gttttcaggg 
actaggttgg gctagatatg gggctcagga tggacaaggc ttagagccag gttggagaag 
atgaaagagc attactagag gagtggggag gcctaggcta tgctctttac tctgccattg 
actgcgtgat cttgggcagg ccatgtaacc tctcagggct gtgcactccc ttatttgtaa 
aactagaggg ctgggccagc atgtttt 



60 
120 
180 
240 
300 
360 
420 
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H. sapiens LU gene for Lutheran blood group glycoprotein. 
Lutheran blood group glycoprotein. 



r /trans 1 at i on= " MEPPDAPAQARGAPRLLLIAVLIJ>AHPDAQAEVRLSVPPLVEVMR 

1 GKSVILDCTPTGTHDHYMLEWFLTDRSGARPRLASAEMQGSELQVTMHDTRGRSPPYQL 

DSQGRLVLAEAQVGDERDYVCVVRA.GAAGTAEATARIjNVFAKPEATEVSPNKGTLSVME 
DSAQEIATCNSRNGNPAPKITWYRNGQRLEVPVEMNPEGYMTSRTVREASGLLSLTSTL 
YLRLRKDDRDASFHCAAHYSLPEGRHGRLDSPTFHLTLHYPTEHVQFWVGSPSTPAGWV 
REGDWQLLCRGDGSPSPEYTLFRLQDEQEEVLNVNLEGNLTLEGVTRGQSGTYGCRVE 
DYDAADDVQLSKTLELRVAYLDPLELSEGKVLSLPLNSSAWNCSVHGLPTPALRWTKD 
STPLGDGPMLSLSSITFDSNGTYVCEASLPTVPVLSRTQNPTLLVQGSPELKTAEIEPK 
ADGSWREGDEVTLICSARGHPDPKLSWSQLGGSPAEPIPGRQGWVSSSLTLKVTSALSR 
DGISCEASNPHGNKRHVFHFGAVSPQTSQAGVAVMAVAVSVGLLLLVVAVFyCVRRKGG 
PCCRQRREKGAPPPGEPGLSHSGSEQPEQTGLLMGGASGGARGGSGGFGDEC » 

agtctccgcc gccgccgtga acatggagcc cccggacgca ccggcccagg cgcgcggggc 60 

cccgcggctg ctgttgctcg cagtcctgct ggcggcgcac ccagatgccc aggcggaggt 12 0 

gcgcttgtct gtacccccgc tggtggaggt gatgcgagga aagtctgtca ttctggactg 180 

cacccctacg ggaacccacg accattatat gctggaatgg ttccttaccg accgctcggg 240 

agctcgcccc cgcctagcct cggctgagat gcagggctct gagctccagg tcacaatgca 300 

cgacacccgg ggccgcagtc ccccatacca gctggactcc caggggcgcc tggtgctggc 360 

SHS^ Cag 9t9g9Cgac 9 agcgagacta cgtgtgcgtg gtgagggcag gggcggcagg 420 

cactgctgag gccactgcgc ggctcaacgt gtttgcaaag ccagaggcca ctgaggtctc 480 

ccccaacaaa gggacactgt ctgtgatgga ggactctgcc caggagatcg ccacctgcaa 540 

cagccggaac gggaacccgg cccccaagat cacgtggtat cgcaacgggc agcgcctgga 600 

ggtgcccgta gagatgaacc cagagggcta catgaccagc cgcacggtcc gggaggcctc 660 

gggcctgctc tccctcacca gcaccctcta cctgcggctc cgcaaggatg accgagacgc 720 

'H™ ff° t 9°9fegccc actacagcct gcccgagggc cgccacggcc gcctggacag 780 

ccccaccttc cacctcaccc tgcactatcc cacggagcac gtgcagttct gggtgggcag 840 

cccgtccacc ccagcaggct gggtacgcga gggtgacact gtccagctgc tctgccgggg 900 

ggacggcagc cccagcccgg agtatacgct tttccgcctt caggatgagc aggaggaagt 960 

gctgaatgtg aatctcgagg ggaacttgac cctggaggga gtgacccggg gccagagcgg 1020 

gacctatggc tgcagagtgg aggattacga cgcggcagat gacgtgcagc tctccaagac 1080 

gctggagctg cgcgtggcct atctggaccc cctggagctc agcgagggga aggtgctttc 1140 

cttacctcta aacagcagtg cagtcgtgaa ctgctccgtg cacggcctgc ccacccctgc 12 00 

cctacgctgg accaaggact ccactcccct gggcgatggc cccatgctgt cgctcagttc 1260 

tatcaccttc gattccaatg gcacctacgt atgtgaggcc tccctgccca cagtcccggt 1320 

cctcagccgc acccagaact tcacgctgct ggtccaaggc tcgccagagc taaagacagc 1380 

ggaaatagag cccaaggcag atggcagctg gagggaagga gacgaagtca cactcatctg 1440 

ctctgcccgc ggccatccag accccaaact cagctggagc caattggggg gcagccccgc 1500 

agagccaatc cccggacggc agggttgggt gagcagctct ctgaccctga aagtgaccag 1560 

cgccctgagc cgcgatggca tctcctgtga agcctccaac ccccacggga acaagcgcca 1620 

tgtcttccac ttcggcgccg tgagccccca gacctcccag gctggagtgg ccgtcatggc 1680 

cgtggccgtc agcgtgggcc tcctgctcct cgtcgttgct gtcttctact gcgtgagacg 1740 

caaagggggc ccctgctgcc gccagcggcg ggagaagggg gctccgccgc caggggagcc 1800 

agggctgagc cactcggggt cggagcaacc agagcagacc ggccttctca tgggaggtgc 1860 

ctccggagga gccaggggtg gcagcggggg cttcggagac gagtgctgag ccaagaacct 1920 

cctagaggct gtccctggac ctggagctgc aggcatcaga gaaccagccc tgctcacgcc 1980 

atgcccgccc ccgccttccc tcttccctct tccctctccc tgcccagccc tcccttcctt 2040 

lllS^Jt Caa39Ca 999 acccacagtg gctgcctgcc tccgggaggg aaggagaggg 2100 

agggtgggtg ggtgggaggg ggccttcctc cagggaatgt gactctccca ggccccagaa 2160 

tagctcctgg acccaagccc aaggcccagc ctgggacaag gctccgaggg tcggctggcc 2220 

ggagctattt ttacctcccg cctcccctgc tggtcccccc acctgacgtc ttgctgcaga 2280 

gtctgacact ggattccccc ccctcacccc gcccctggtc ccactcctgc ccccgcccta 2340 

cctccgcccc accccatcat ctgtggacac tggagtctgg aataaatgct gtttgtcaca 2400 



2402 



Homo sapiens mRNA for calmegin, complete cds. 

/trans 1 at ion= » MHFQAFWLCLGLLFIS INAEFMDDDVETEDFEENSEBIDVNESEL 
SSEIKYKTPQPIGEVYFAETFDSGRIlAGWVLSKAKKDD^©EEISIYDGRWEIEELKENQ 
VPGDRGLVLKSRTUQDIAISAVIiAKPFIFADKPLIVQYEVNFQDGIDCGGAyiKLIiADTD 
DLILENFYDKTSYIIMFGPDKCGEDYKLHFIFimKHPKTGVFEEKHAKPPDVDLKKFFT 
DRKTHIiYTLVMNPDDTFEVLVDQTVVNKGSIi^ 

AKI PDPS AVKPEDWDESEPAQ I EDS S WKPAGWLDDEPKF I PDPNAEKPDDWNEDTDGE 
WEAPQ ILNPACRI GCGEWKPPM IDNPKYKGVWRPPLVDNPNYQGI WS PRKI PNPDYFED 
DHPFLLTSFSALGLELWSIOTSDIYFDNFIICSEKEVADHWAADGWRWKIMIANANKPGV 
LKQIjMAAAEGHPWLWIjIYIjVTAGVPIALITSFCWPRKVKKKHKDTEYKKTDICIP 
VIiEQEEKEEKAAIjEKPMDIjEEEKKQNDGEMLEKEEESEPEEKSEEEXEIXEGQEESNQS 
NKSGSEDEMKEADESTGSGDGPIKSVRKRRVRKD " 



cgccggcggg actggtctga agagacgcgg ggacaaagtg gcaacgactt ggacatctga 60 

gctgtcactg ccgaaaacag gccgcaagag agataatcaa tatgcatttc caagcctttt 12 0 

ggctatgttt gggtcttctg ttcatctcaa ttaatgcaga atttatggat gatgatgttg 180 

agacggaaga ctttgaagaa aattcagaag aaattgatgt taatgaaagt gaactttcct 24 0 

cagagattaa atataagaca cctcaaccta taggagaagt atattttgca gaaacttttg 300 

atagtggaag gttggctgga tgggtcttat caaaagcaaa gaaagatgac atggatgagg 360 

aaatttcaat atacgatgga agatgggaaa ttgaagagtt gaaagaaaac caggtacctg 420 

gtgacagagg actggtatta aaatctagag caaagcatca tgcaatatct gctgtattag 480 

caaaaccatt catttttgct gataaaccct tgatagttca atatgaagta aattttcaag 540 

atggtattga ttgtggaggt gcatacatta aactcctagc agacactgat gatttgattc 600 

tggaaaactt ttatgataaa acatcctata tcattatgtt tggaccagat aaatgtggag 660 

aagattataa acttcatttt atcttcagac ataaacatcc caaaactgga gttttcgaag 72 0 

agaaacatgc caaacctcca gatgtagacc ttaaaaagtt ctttacagac aggaagactc 780 

atctttatac ccttgtgatg aatccagatg acacatttga ggtgttagtt gatcaaacag 84 0 

ttgtaaacaa aggaagcctc ctagaggatg tggttcctcc tatcaaacct cccaaagaaa 900 

ttgaagatcc caatgataaa aaacctgagg aatgggatga aagagcaaaa attcctgatc 960 

cttctgccgt caaaccagaa gactgggatg aaagtgaacc tgcccaaata gaagattcaa 1020 

gtgttgttaa acctgctggc tggcttgatg atgaaccaaa atttatccct gatcctaatg 1080 

ctgaaaaacc tgatgactgg aatgaagaca cggatggaga atgggaggca cctcagattc 1140 

ttaatccagc atgtcggatt gggtgtggtg agtggaaacc tcccatgata gataacccaa 1200 

aatacaaagg agtatggaga cctccactgg tcgataatcc taactatcag ggaatctgga 1260 

gtcctcgaaa aattcctaat ccagattatt tcgaagatga tcatccattt cttctgactt 1320 

ctttcagtgc tcttggttta gagctttggt ctatgacctc tgatatctac tttgataatt 1380 

ttattatctg ttcggaaaag gaagtagcag atcactgggc tgcagatggt tggagatgga 1440 

aaataatgat agcaaatgct aataagcctg gtgtattaaa acagttaatg gcagctgctg 1500 

aagggcaccc atggctttgg ttgatttatc ttgtgacagc aggagtgcca atagcattaa 1560 

ttacttcatt ttgttggcca agaaaagtaa agaaaaaaca taaagataca gagtataaaa 1620 

aaaccgacat atgtatacca caaacaaaag gagtactaga gcaagaagaa aaggaagaga 1680 

aagcagccct ggaaaaacca atggacctgg aagaggaaaa aaagcaaaat gatggtgaaa 1740 

tgcttgaaaa agaagaggaa agtgaacctg aggaaaagag tgaagaagaa attgaaatca 1800 

tagaagggca agaagaaagt aatcaatcaa ataagtctgg gtcagaggat gagatgaaag 1860 

aagcagatga gagcacagga tctggagatg ggccgataaa gtcagtacgc aaaagaagag 192 0 

tacgaaagga ctaaactaga ttgaaatatt tttaattccc gagaggatgt ttggcattgt 1980 

aaaaatcagc atgccagacc tgaactttaa tcagtctgca catcctgttt ctaatatcta 2040 

gcaacattat attctttcag acatttattt tagtccttca tttccgagga aaaagaagca 2100 

actttgaagt tacctcatct ttgaatttag aataaaagtg gcacattaca tatcggatct 2160 

aagagattaa taccattaga agttacacag ttttagttgt ttggagatag ttttggtttg 2220 

tacagaacaa aataatatgt agcagcttca ttgctattgg aaaaatcagt tattggaatt 2280 

tccacttaaa tggctataca acaatataac tggtagttct ataataaaaa tgagcatatg 2340 

ttctgttgtg aagagctaaa tgcaataaag tttctgtatg gttgtttgat tctatcaaca 2400 

attgaaagtg ttgtatatga cccacattta cctagtttgt gtcaaattat agttacagtg 2460 

agttgtttgc ttaaattata gattccttta aggacatgcc ttgttcataa aatcactgga 2520 

ttatattgca gcatatttta catttgaata caaggataat gggttttatc aaaacaaaat 2580 



gatgtacaga ttttttttca agtttttata gttgctttat gccagagtgg tttaccccat 264 0 

tcacaaaatt tcttatgcat acattgctat tgaaaataaa atttaaatat tttttcatcc 2700 
tgaaaaaaaa 2710 



wx78h04.xl NCI_CGAP_Ov38 Homo sapiens cDNA clone IMAGE : 2549815 3', mRNA 
sequence . 

agcaatttga atcatttctt gaaaaacaaa cacagacaaa caccaaacat ggagttggtg 60 

cccggcgccg ggcataaggg cagcacccca cgggtggctg tgcggggggc cgctgggtgt 120 

ggccgggccc tgtgtgcctg tgcaggggcc cagctcctcg gggactggcc cacgaccccc 180 

cactcagcgg gctgagccaa tgcctggccg agagggggcc gcagccagca ggcttgggtg 240 

gctgcccgcg cccgcagggg acatcgggga aatgggggca gagtgcggga cccacacgct . 300 

gcctgaggag tcttggcagg gtggacaggc ctgggggtct ctaccagcaa tgcaataaat 360 

atgcaaatcc aagcacagaa agaccaagcg cagaccccac gggcgcacga ggcccagccc 420 

agttcctgcg ggcacgggca ccaccggctc ttcacagacc aggagt 466 



Human CD9 antigen mRNA, complete cds. 



CD 9 antigen. 



/trans 1 at ion= n MPVKGGTKCIKVTjLFGFNF IFWI1AGIAVI1AIGLWLRFDSQTKS I F 
EQETNNNNSS FYTGVYILIGAGALMMLVGFLGCCGAVQESQC^^JGLFFGFIJLVI FAI E I 
AAAIWGYSHKDEVIKEVQEFYKDTYNKLKTKDEPQRETLKAI 

SDICPKKDVLETFTVKSCPDAIKEVFDNKFHIIGAVGIGIAVVMIFGMIFSMILCCAIR 
RNREMV" 



Sequence 1192 BP; 310 
cgcgcccccc agtcccgcac 
aaaggaggca ccaagtgcat 
gccgggattg ctgtccttgc 
atcttcgagc aagaaactaa 
atcggagccg gcgccctcat 
gagtcccagt gcatgctggg 
atagctgcgg ccatctgggg 
ttttacaagg acacctacaa 
aaagccatcc actatgcgtt 
tcagacatct gccccaagaa 
gccatcaaag aggtcttcga 
gccgtggtca tgatatttgg 
aaccgcgaga tggtctagag 
ttggtgggat tttttgtttg 
ccactaattt tagtattcat 
tcttttaatg cttcattcaa 
gtttatattt ttcagttgtt 
aggtactata tttgctagac 
taaatagata caaatgtcta 
gatacataat aaaaaattat 



A; 243 C; 273 G; 366 T; 0 Other; 
ccgttcggcc caggctaagt tagccctcac 
caaatacctg ctgttcggat ttaacttcat 
cattggacta tggctccgat tcgactctca 
taataataat tccagcttct acacaggagt 
gatgctggtg ggcttcctgg gctgctgcgg 
actgttcttc ggcttcctct tggtgatatt 
atattcccac aaggatgagg tgattaagga 
caagctgaaa accaaggatg agccccagcg 
gaactgctgt ggtttggctg ggggcgtgga 
ggacgtactc gaaaccttca ccgtgaagtc 
caataaattc cacatcatcg gcgcagtggg 
catgatcttc agtatgatct tgtgctgtgc 
tcagcttaca tccctgagca ggaaagttta 
tttgttttgt tttgtttgtt gtttgttgtt 
tctgcattgc tagataaaag ctgaagttac 
tattgacatt tgtagttgag cggggggttt 
tgtttttgct tgttatatta agcagaaatc 
tctagacaag atattgtaca taaaagaatt 
tcaactttaa tcaagttgta acttatattg 
gacaatgaaa aaaaaaaaaa aaaaaaaaaa 



catgccggtc 


60 


cttctggctt 


120 


gaccaagagc 


180 


ctatattctg 


240 


ggctgtgcag 


300 


cgccattgaa 


360 


agtccaggag 


420 


ggaaacgctg 


480 


acagtttatc 


540 


ctgtcctgat 


600 


catcggcatt 


660 


tatccgcagg 


720 


cccatgaaga 


780 


tgtttttttg 


840 


tttatgtttg 


900 


ggtttgcttg 


960 


ctgcaatgaa 


1020 


tttttgtctt 


1080 


aagacaattt 


1140 


gg 


1192 




Homo sapiens cDNA clone :HEMBA10 0132 8 , 3' end, expressed in whole embryo, 
mainly head. 

3 1 -end sequence (3 1 -EST) ; EST (expressed sequence tag); oligo capping* 



gtagccttta tttacttaaa catttatttg cttctaggaa ataagcgctt tcctaatttc 60 

aagcaattat aaaagaactg ctgttttctt ccacactcac ttgccagagg gtcgaattgg 12 0 

aagtcacata tatgtctatg aacggaagtt aaaagggaaa ttcaacatga agatgaaatt 180 

ctgaactttc ctagataaat taacattgct gggtggaaat attcagatgc tgcttaaata 24 0 

cttcggtaaa cactgggtaa gattcatgga acttagaaaa aagctgtatg aactgcttta 300 

ccaaatatca ctactgagga aatgtataaa ataccacata gtataaaatt acatgttaat 36 0 

ccaatgccag attttaaata aaggacctta agttttcctc aagggggaag tttaatgggt 42 0 

cnttcccgnt ntcanagggc caaaaanttc ccaaggaaac caggtagnaa gctcttnaaa 480 

ggccgcaaaa t 491 



Homo sapiens 7 -dehydrochole sterol reductase, mRNA (cDNA clone MGC:1760 
IMAGE : 3507516) , complete cds . 



/ translation " MAAKS QPNIPKAKSIJX3VTNDRTASQGQWGRAWEVDWFSLAS VTF 

LLLFAPF I VYYF IMACDQYS CALTGPWD I VTGHARLSD I WAKTPPITRKAAQLYTLWV 

TFQVIiLYTSLPDFCHKFLPGYVGGIQEGAVTPAGVVlJKYQINGIjQAWL 

IiLS WFS PTI IFDNW I PLLWCANT LGYAVSTFAMVKGYFFPTSARDCKFTGNFFYNYMMG 

IEFNPR IGKWFDFKLFFNGRPGI VAWTL INLS FAAKQRELHSHVTNAMVIjVNVLQAI YV 

IDFFWNETWYIjKTIDI CHDHFGWYLGWGD CVWLPYLYTLQGIiYIjVYHP VQLSTPHAVGV 

I^LGLVGYYIFRVANHQKDLFRRTDGRCLIWGRKPOT 

WGVARHFNYVGDLMGSLAYCLACGGGHLIjPYFYI I YMAI LLTHRCLRDEHRCAS KYGRD 
WERYTAAVP YRLLPG I F " 



gtggagcagc gcgcgcaagc gaggccaggg gaaggtgggc gcaggacttt agccggttga 60 
gaaggatcaa gcaggcattt ggagcacagg fcgtctagaaa cttttaaggg gccggttcaa 120 
gaaggaaaag ttcccttctg ctgtgaaact atctggcaag aggctggagg gcccaatggc 180 
tgcaaaatcg caacccaaca ttcccaaagc caagagtcta gatggcgtca ccaatgacag 240 
aaccgcatct caagggcagt ggggccgtgc ctgggaggtg gactggtttt cactggcgag 3 00 

cgtcatcttc ctactgctgt tcgccccctt catcgtctae tacttcatca tggcttgtga 3 60 

ccagtacagc tgcgccctga ccggccctgt ggtggacatc gtcactggac atgctcggct 420 
ctcggacatc tgggccaaga ctccacctat aacgaggaaa gccgcccagc tctatacctt 4 80 

gtgggtcacc ttccaggtgc ttctgtacac gtctctccct gacttctgcc ataagtttct 540 
acccggctac gtaggaggca tccaggaggg ggccgtgact cctgcagggg ttgtgaacaa 600 
gtatcagatc aacggcctgc aagcctggct cctcacgcac ctgctctggt ttgcaaacgc 660 
tcatctcctg tcctggttct cgcccaccat catcttcgac aactggatcc cactgctgtg 720 
gtgcgccaac atccttggct atgccgtctc caccttcgcc atggtcaagg gctacttctt 780 
ccccaccagc gccagagact gcaaattcac aggcaatttc ttttacaact acatgatggg 84 0 

catcgagttt aaccctcgga tcgggaagtg gtttgacttc aagctgttct tcaatgggcg 900 
ccccgggatc gtcgcctgga ccctcatcaa cctgtccttc gcagcgaagc agcgggagct 960 

ccacagccat gtgaccaatg ccatggtcct ggtcaacgtc ctgcaggcca tctacgtgat 1020 

tgacttcttc tggaacgaaa cctggtacct gaagaccatt gacatctgcc atgaccactt 1080 

cgggtggtac ctgggctggg gcgactgtgt ctggctgcct tatctttaca cgctgcaggg 1140 

tctgtacttg gtgtaccacc ccgtgcagct gtccaccccg cacgccgtgg gcgtcctgct 1200 

gctgggcctg gtgggctact acatcttccg ggtggccaac caccagaagg acctgttccg 12 60 

ccgcacggat gggcgctgcc tcatctgggg caggaagccc aaggtcatcg agtgctccta 1320 

cacatccgcc gacgggcaga ggcaccacag caagctgctg gtgtcgggct tctggggcgt 13 80 

ggcccgccac ttcaactacg tcggcgacct gatgggcagc ctggcctact gcctggcctg 144 0 

tggcggcggc cacctgctgc cctacttcta catcatctac atggccatcc tgctgaccca 15 00 

ccgctgcctc cgggacgagc accgctgcgc cagcaagtac ggccgggact gggagcgcta 1560 

caccgccgca gtgccttacc gcctgctgcc tggaatcttc taagggcacg ccctagggag 1620 

aagccctgtg gggctgtcaa gagcgtgttc tgccaggtcc atgggggctg gcatcccagc 1680 

tccaactcga ggagcctcag tttcctcatc tgtaaactgg agagagccca gcacttggca 1740 

ggtgtccagt acctaatcac gcttttgcct tcaagggaat tccgagtgtc cagcactgcc 1800 

gtattgccag cacagacgga ttttctctaa tcagtgtccc tggggcagga ggatgaccca 1860 

gtcaccttta ctagtccttt ggagacaatt tacctgtatt aggagcccag gccacgctac 1920 

actctgccca cactggtgag caggaggtct tcccacgccc tgtcattagg ctgcatttac 1980 

tcttgctaaa taaaagtggg agtggggcgt gcgcgttatc catgtattgc ctttcagctc 2 040 

tagatccccc tcccctgcct gctctgcagt tgtgggtggg gcccgtgcgc cgtttctcct 2100 

tggtagcgtg cacggtgttg aactgggaca ctggggagaa aggggctttc atgtcgtttc 2160 

cttcctgctc ctgctgcaca gctgccagga gtgctctgcc tggagtctgc agacctcaga 2220 

gaggtcccag cactggctgt ggcctttcag gtgtaggcag gtgggctctg cttcccgatt 2280 

ccctgtgagc gcccaccctc tcgaaagaat tttctgcttg ccctgtgact gtgcagactc 2340 

tggctcgagc aacccgggga acttcaccct caggggcctc ccacaccttc tccagcgagg 24 00 

aggtctcagt cccagcctcg ggagggcacc tccttttctg tgctttcttc cctgaggcat 24 60 

tcttcctcat ccctagggtg ttgtgtagaa ctctttttaa actctatgct ccgagtagag 2520 

ttcatcttta tattaaactt cccctgttca aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2580 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2614 



Homo sapiens squalene epoxidase (ERG1) mRHA, complete cds . 

/translations "MWTFLGIATFTYFYKKFGDFITL 

CRHRNGGLiIjGRQQSGS QFALFSD I LSGLPF I GFFWAKS P PE S ENKEQLEARRRRKGTNI 

SETSIiIGTAACTSTSSQNDPEVIIVGAGVLGSALA^ 

GEFLQPGGYHVIiKDLGLGDTVEGIjDAQVVN^^ 

AFHHGRF I MSLRKAAMAEPNAKF I EGWLQLIjEEDDWMGVQYKDKETGDIKELHAPIiT 

WADGLFSKFRKSLVSNKVSVSSHFVGFLMKN^ 

ETRVLVDIRGEMPRNIjREYMVEKIYPQIPDHL 

KRGVLLLGD AYNMRHPLTGGGMTVAFKD I KLWRKLLKG I PDL YDDAAI FEAKKS FYWAR 
KTSHSFVVNIIiAQALYELFSATDDSLHQLRKACFLYFKLGGEWAGPVGLLSVLSPNPL 
VlilGHFFAVAIYAVYFCFKSEPWITKPRAIjLSSGAVIiYKACSVIFPLIYSEMKYMVH" 

ctggtctgat cggacttctc gtcctgggac acagtttact ggagtctggc cggctctccg 60 

tgctcctctt ggtacctcat tttggggaga accttaaacc cactcgagca gataatctcc 120 

gccttgaccg gtgccaccaa agaagccttg gaaccatgtg gacttttctg ggcattgcca 180 

ctttcaccta tttttataag aagttcgggg acttcatcac tttggccaac agggaggtcc 240 

tgttgtgcgt gctggtgttc ctctcgctgg gcctggtgct ctcctaccgc tgtcgccacc 3 00 

gaaacggggg tctcctcggg cgccagcaga gcggctccca gttcgccctc ttctcggata 360 

ttctctcagg cctgcctttc attggcttct tctgggccaa atccccccct gaatcagaaa 420 

ataaggagca gctcgaggcc aggaggcgca gaaaaggaac caatatttca gaaacaagct 480 

taataggaac agctgcctgt acatcaacat cttctcagaa tgacccagaa gttatcatcg 540 

tgggagctgg cgtgcttggc tctgctttgg cagctgtgct ttccagagat ggaagaaagg 600 

tgacagtcat tgagagagac ttaaaagagc ctgacagaat agttggagaa ttcctgcagc 660 

cgggtggtta tcatgttctc aaagaccttg gtcttggaga tacagtggaa ggtcttgatg 720 

cccaggttgt aaatggttac atgattcatg atcaggaaag caaatcagag gttcagattc 780 

cttaccctct gtcagaaaac aatcaagtgc agagtggaag agctttccat cacggaagat 840 

tcatcatgag tctccggaaa gcagctatgg cagagcccaa tgcaaagttt attgaaggtg 900 

ttgtgttaca gttattagag gaagatgatg ttgtgatggg agttcagtac aaggataaag 960 

agactggaga tatcaaggaa ctccatgctc cactgactgt tgttgcagat gggcttttct 1020 

ccaagttcag gaaaagcctg gtctccaata aagtttctgt atcatctcat tttgttggct 1080 

ttcttatgaa gaatgcacca cagtttaaag caaatcatgc tgaacttatt ttagctaacc 1140 

cgagtccagt tctcatctac cagatttcat ccagtgaaac tcgagtactt gttgacatta 1200 

gaggagaaat gccaaggaat ttaagagaat acatggttga aaaaatttac ccacaaatac 1260 

ctgatcacct gaaagaacca ttcttagaag ccactgacaa ttctcatctg aggtccatgc 1320 

cagcaagctt ccttcctcct tcatcagtga agaaacgagg tgttcttctt ttgggagacg 1380 

catataatat gaggcatcca cttactggtg gaggaatgac tgttgctttt aaagatataa 1440 

aactatggag aaaactgcta aagggtatcc ctgaccttta tgatgatgca gctattttcg 1500 

aggccaaaaa atcattttac tgggcaagaa aaacatctca ttcctttgtc gtgaatatcc 1560 

ttgctcaggc tctttatgaa ttattttctg ccacagatga ttccctgcat caactaagaa 1620 

aagcctgttt tctttatttc aaacttggtg gcgaatgtgt tgcgggtcct gttgggctgc 1680 

tttctgtatt gtctcctaac cctctagttt taattggaca cttctttgct gttgcaatct 1740 

atgccgtgta tttttgcttt aagtcagaac cttggattac aaaacctcga gcccttctca 1800 

gtagtggtgc tgtattgtac aaagcgtgtt ctgtaatatt tcctctaatt tactcagaaa 1860 

tgaagtatat ggttcattaa gcttaaaggg gaaccatttg tgaatgaata tttggaactt 1920 

accaagtcct aagagacttt tggaagagga tatatatagc atagtaccat accacttata 1980 

aagtggaaac tcttggacca agattaggat taatttgttt ttgaagtttt ttgtatataa 2040 

atatgtaaat acatgcttta atttgcaatt taaaatgaag gggttaaata agttagacat 2100 

ttaaaagaaa tgattgttac cataaattag tgctaatgct gaggagaact acagtttttc 2160 

ttttgaattt agtatttgag atgagttgtt gggacatgc 2199 



E Homo sapiens keratin 23 (histone deacetylase inducible) , transcript variant 

E 1, mRNA (cDNA clone MGC:26158 IMAGE : 4838347) , complete cds . 

T / trans 1 at i on= " MNSGHSFSQTPSAS FHGAGGGWGRPRS FPRAPTVHGGAGGARISL 

T SFTTRSCPPPGGSWGSGRSSPIiLGGNGKATMQNIiNDRIiASYVEKVTlAL 

T LKWHQQRDPGSKKDYSQYEENITHLQEQIVDGKMTNAQI I L L I DNARMAVDD FNLKYEN 

T EHS FKKDIiE I EV^GLRRTLDNLTIVTTDLEQEVEGMRKEL I LMKKHHEQEMEKHHVPSD 

T FNVNVKVDTGPREDL I KVLEDMRQEYEL 1 1 KKKHRDLDTWYKEQSAAMS QEAAS PATVQ 

T SRQGD IHELKRTFQAIiE IDLQTQ YS TKS ALENML S ETQSR YS CKLQDMQE 1 1 SHYEEEL 

T TQLRHELERQNNE YQ VXjLG I KTHLEKE I TTYRRLLEGE S EGTREESKS SMKVFATPKI K 

T AITQETINGRLVLCQVNE IQKHA n 

agggggaaat cctgagcgca ggccagggtt gtttggtttt gaggtgtgct gggatgaaag 60 

gcaccctgga agtggaaggt aaatgagcaa tggaaaaact tcacggcaag attagaaaga 120 

tacctgagcc caatacccgc ctgatgtcgt gggccacacc tccgggttac caggggaagg 180 

gaggaagcaa actgtcatat tgatgtggct ctaaacaaca acagtgtgcg aaggcccagg 240 

ggcactttgg gattgaccaa gaggaaacac aagttgcaca atgatacaat cttgttggta 300 

caattgtcag agaagggaac tcccacagca aaggccataa aaccatccag ggcagtctgg 360 

ggcggctcag ttctgcggtg ccagggagtg gagcagagct cagccccgtc ccaaacacag 420 

atgggaccat gaactccgga cacagcttca gccagacccc ctcggcctcc ttccatggcg 480 

ccggaggtgg ctggggccgg cccaggagct tccccagggc tcccaccgtc catggcggtg 540 

cggggggagc ccgcatctcc ctgtccttca ccacgcggag ctgcccaccc cctggagggt 600 

cttggggttc tggaagaagc agccccctac taggcggaaa tgggaaggcc accatgcaga 660 

atctcaacga ccgcctggcc tcctacgtgg agaaggttcg cgccctggag gaggccaaca 720 

tgaagctgga aagccgcatc ctgaaatggc accagcagag agatcctggc agtaagaaag 780 

attattcaca gtatgaggaa aacatcacac acctgcagga gcagatagtg gatggtaaga 840 

tgaccaatgc tcagattatt cttctcattg acaatgccag gatggcagtg gatgacttca 900 

acctcaagta tgaaaatgaa cactccttta agaaagactt ggaaattgaa gtcgagggcc 960 

tccgaaggac cttagacaac ctgaccattg tcacaacaga cctagaacag gaggtggaag 1020 

gaatgaggaa agagctcatt ctcatgaaga agcaccatga gcaggaaatg gagaagcatc 1080 

atgtgccaag tgacttcaat gtcaatgtga aggtggatac gggtcccagg gaagatctga 1140 

ttaaggtcct ggaggatatg agacaagaat atgagcttat aataaagaag aagcatcgag 1200 

acttggacac ttggtataaa gaacagtctg cagccatgtc ccaggaggca gccagtccag 1260 

ccactgtgca gagcagacaa ggtgacatcc acgaactgaa gcgcacattc caggccctgg 132 0 

agattgacct gcagacacag tacagcacga aatctgcttt ggaaaacatg ttatccgaga 1380 

cccagtctcg gtactcctgc aagctccagg acatgcaaga gatcatctcc cactatgagg 1440 

aggaactgac gcagctacgc catgaactgg agcggcagaa caatgaatac caagtgctgc 1500 

tgggcatcaa aacccacctg gagaaggaaa tcaccacgta ccgacggctc ctggagggag 1560 

agagtgaagg gacacgggaa gaatcaaagt cgagcatgaa agtgtttgca actccaaaga 1620 

tcaaggccat aacccaggag accatcaacg gaagattagt tctttgtcaa gtgaatgaaa 1680 

tccaaaagca cgcatgagac caatgaaagt ttccgcctgt tgtaaaatct attttccccc 1740 

aaggaaagtc cttgcacaga caccagtgag tgagttctaa aagataccct tggaattatc 1800 

agactcagaa acttttattt tttttttctg taactgtctc accagacttc tcataatgct 1860 

cttaatatat tgcacttttc taatcaaagt gcgagtttat gagggtaaag ctctactttc 1920 

ctactgcagc cttcagattc tcatcatttt gcatctattt tgtagccaat aaaactccgc 1980 

actagcaaaa aaaaaaaaaa aa * 2002 



Homo sapiens trans locon- associated protein gamma subunit mRNA, complete 
cds . 

/translations "MAPKGSSKQQSEEDLIiIiQDFSRNLSAKSSALFFGNAFIVSAIPIW 
LYWRI VJHMDLi I QS AVLYS VMTLVS TYLVAFAYKITVKFVIjKHKVAQKEIEI) AVS KEVTRKL 
SEADNRKMSRKEKDERILWKKNEVADYEATTFS IFYNNTLFLWVI VAS FF ILKNFNPT 
VNYIIiSISASSGIiIALLSTCSK" 



60 
120 
180 
240 
300 
360 
420 
480 



cctttgcccg cttggcggcc ggctctacgt tccctgttct cgcctgcagc tccgccatgg 
ctcctaaagg cagctccaaa cagcagtctg aggaggacct gctcctgcag gatttcagcc 
gcaatctctc ggccaagtcc tccgcgctct tcttcggaaa cgcgttcatc gtgtctgcca 
tccccatctg gttatactgg cgaatatggc atatggatct tattcagtct gctgttttgt 
atagtgtgat gaccctagta agcacatatt tggtagcctt tgcatacaag aatgtgaaat 
ttgttctcaa gcacaaagta gcacagaaga gggaggatgc tgtttccaaa gaagtgactc 
gaaaactttc tgaagctgat aatagaaaga tgtctcggaa ggagaaagat gaaagaatct 
tgtggaagaa gaatgaagtt gctgattatg aagctacaac attttccatc ttctataaca 
acactctgtt cctggtcgtg gtcattgttg cttccttctt catattgaag aacttcaacc 540 
ccacagtgaa ctacatattg tccataagtg cttcatcagg actcatcgcc ctcctgtcta 600 
ctggctccaa atagaccatg tcagcttcac cccctggctt tgtgtctatg ggtggcctgt 660 
ggtatatgga aaagtagcag ggtggtcagg gtgggagaca caagatgttt ttatagtcta 720 
gagcctttaa aaaacccagc agaatgtaat tcagtatttg tttattggct gttttttgac 780 
agattgttga aattaaatga attgaaaggg aaactcagag tactaggacg tttattaaaa 840 
ggaaaaaaat gtcttgcaat gtgctgtaat cacaagagga gaaaataact tgtttccttg 900 
atctgtcaga ggtcacagta acctgggccg agctgttatt atttattata taatagtagt 960 
aggaagttaa taactggttc tctgtgttcc aagcacaata ttacaacttc ttttgaaccg 1020 
taaatatcag aatgaatcct cttcccaggg gattgaacag aagcttaatg tttacaagtg 1080 
tttgaatttg tgatctgaaa taacacaaaa ttaaaaacat gatttctcta attttccaac 1140 
tagaggagag agacttgtgg aaaagttctt tttttctctt ttttttttct taaagaaggg 1200 
cagccaaggt agtaacctaa aaatagtgcc caggcatatg agagttgtcc tacgaggtta 
aagaacacac tgttccactg tatggctttg gcctgagtgg ccagggaggt caacttgacc 
ctgccatgtt ggtttgactt actaagacac aggaatcatt gttttccttg accagggtct 
cacaccctgg aggaatgtta agtaagagaa agaacctctt tcctgaatat tgacatgtaa 
aagaccaaag taatttttct gaacttctgc aattctgaga actctccaag gaatttacag 
tgattttagt gcttgtcagc atttttccat gaggactttc atacatttga ctctttagtt 
cacaggttcc cattgattgt gagcaagata tttatctctt tagcccttgg gatccagctg 1620 
agagcaatct cttgcatttt tttacccgtg tatgtacaga tatcatttct tgtgtatgcc 1680 
atgacttgaa aaagtttggg aagctcttta gcaatatcag ctaaaaggat atgaaatcac 1740 
aggtgatagc agttgtcatt cagtaatttc ctacaagcag caccccaaag gaaatatagt 1800 
cctaatcttt actatccact tctaaattta atgtgaattt catacatgtt attagttgtt 1860 
ttctttataa ttttataaaa attattcatc gggagtttaa cttccacttc catgctatcg 1920 
gatgtgttgg gctccatgca agaacttgga agaaaaacag gcaggaatgc atttgcataa 1980 
tgacccagat catcattttc tgcaactgag aattatattt catcattgct tctagaagtc 9n4Q 
tgcaattctt tacttttctt tggtgcatta ttatctaggt gccatcactg gataatgtgg 
agtgactaga gaagtcatat atcactgtaa ggtacagtta gggtaacact ttagaggttt 
attattttta aaaaactttt cttgaactcc tggccaacat ggtgaaaccc cgtctctact 
aaaaatacca aaattagcca ggcgtgatgg tgggtgcctg taatctcagc tacttgggag 2280 
gctgaagcag gagaactgcc tgaacccagg aggcagaggt tgcagtgagt cgagatcgtg 2340 
ctactactgc ctgggtggca agggtgagac tccatctcaa aaaagaaaca aaaaaaccca 2400 
aaaagttttc tttactgttg gttaaaaaaa aaagccagac catagtttga ctggtggcat 2460 
ggaatttgtg tatcaaataa atgcatttgc ttatttgaca aaccatcagt gtccactatt 2520 
tgttaccaga gttgggccac tatcttttaa aattgctggt gaaaacttgc cactagatgg 2580 
agtgctgtat agatggggaa aaaattgcca ccattcttgg tataatacag tgtagcttag 2640 
atgaggtggt gaaatagggg tatcagccga atattcctaa tatagtttct cttgaattaa 2700 
taaactgaag atttgtagga aaatgagtga gcaaaatttg tttactgttg tgaatttttc 2760 
ctacagcact gtttttaatc ttggtgtttt ccaactttct gtactaatag atacatttct 2820 
gtgcataaga ttataaagca tatactcaca gttcagtagt tttcgttaag gatttactgt 2880 
gtgagtactt tactgtgagg aattgcagaa ccttttcccc tctactcttg tctaaaagtt 2940 



1200 
1260 
1320 
1380 
1440 
1500 
1560 



2040 
2100 
2160 
2220 



ctgtgtggca cacagagatg cgacctactc aatctgactt agtaaaacca tgctgtagaa 3000 
tttttgtctt aaaaagacca catacccagc acccatgaaa taaaagattc atctgtaaaa 3060 
(/ a 3061 



Homo sapiens malic enzyme 1, NADP (+) -dependent , cytosolic, mRNA (cDNA clone 
MGC:39115 IMAGE :4870714) , complete cds . 

/ trans lation= "ME PEAPRRRHTHQRGYXjL^ 

PSFNSQEIQVLRWKNFEHLNSDFDRYLIiIj^ 

TVGLACQQYSLVFRKPRGIiFITIHDRGHIASV^ 

CNGMGIPVGKLALYTACGGMNPQECIjPVIIaDVGTENEELIiKDPLYIGIjRQRRVRGSEYD 
DFIiDEFMEAVS SKYGMNCL IQFEDFANVNAFRIiLNKYRNQ YCTFNDD I QGTAS VAVAGL 
LAALRITKNKLSDQTIIjFQGAGEAAIjGIAHIiIVMALEKEGLPKEKAIKKI^^ 
VKGRASLTQEKEKFAHEHEEMKNLEAIVQEIKPTALIGVAAIGGAFSEQILKD 
RPI I FAIiSNPTSKAECSAEQCYKITKGRAIFASGSPFDPVTIiPNGQTLYPGQGNNSYVF 
PGVAIjGVVACGIiRQITDNIFLTTAEVIAQQVSDKHLEEGRIiYPPIjNTIRDVSLKIAEK^ 
VKDAYQEKTATVYPEPQNKEAFVRSQMYSTDYDQIIiPDCYSWPEEVQKI QTKVDQ " 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



gtcaccccag cagcatccgc cgcctgcacc gcgcgtgcgg cccgccccgg cctgaccccg 
ccgccgaacc cggcgccagc catggagccc gaagcccccc gtcgccgcca cacccatcag 
cgcggctacc tgctgacacg gaaccctcac ctcaacaagg acttggcctt taccctggaa 
gagagacagc aattgaacat tcatggattg ttgccacctt ccttcaacag tcaggagatc 
caggttctta gagtagtaaa aaatttcgag catctgaact ctgactttga caggtatctt 
ctcttaatgg atctccaaga tagaaatgaa aaactctttt atagagtgct gacatctgac 
attgagaaat tcatgcctat tgtttatact cccactgtgg gtctggcttg ccaacaatat 
agtttggtgt ttcggaagcc aagaggtctc tttattacta tccacgatcg agggcatatt 
gcttcagttc tcaatgcatg gccagaagat gtcatcaagg ccattgtggt gactgatgga 
gagcgtattc ttggcttggg agaccttggc tgtaatggaa tgggcatccc tgtgggtaaa 
ttggctctat atacagcttg cggagggatg aatcctcaag aatgtctgcc tgtcattctg 
gatgtgggaa ccgaaaatga ggagttactt aaagatccac tctacattgg actacggcag 72 0 

agaagagtaa gaggttctga atatgatgat tttttggacg aattcatgga ggcagtttct 
tccaagtatg gcatgaattg ccttattcag tttgaagatt ttgccaatgt gaatgcattt 
cgtctcctga acaagtatcg aaaccagtat tgcacattca atgatgatat tcaaggaaca 
gcatctgttg cagttgcagg tctccttgca gctcttcgaa taaccaagaa caaactgtct 
gatcaaacaa tactattcca aggagctgga gaggctgccc tagggattgc acacctgatt 
gtgatggcct tggaaaaaga aggtttacca aaagagaaag ccatcaaaaa gatatggctg 
gttgattcaa aaggattaat agttaaggga cgtgcttcct taacacaaga gaaagagaag 
tttgcccatg aacatgaaga aatgaagaac ctagaagcca ttgttcaaga aataaaacca 
actgccctca taggagttgc tgcaattggt ggtgcattct cagaacaaat tctcaaagat 
atggctgcct tcaatgaacg gcctattatt tttgctttga gtaatccaac tagcaaagca 1320 
gaatgttctg cagagcagtg ctacaaaata accaagggac gtgcaatttt tgccagtggc 1380 
agtccttttg atccagtcac tcttccaaat ggacagaccc tatatcctgg ccaaggcaac 1440 
aattcctatg tgttccctgg agttgctctt ggtgttgtgg cgtgtggatt gaggcagatc 1500 
acagataata ttttcctcac tactgctgag gttatagctc agcaagtgtc agataaacac 1560 
ttggaagagg gtcggcttta tcctcctttg aataccatta gagatgtttc tctgaaaatt 1620 
gcagaaaaga ttgtgaaaga tgcataccaa gaaaagacag ccacagttta tcctgaaccg 1680 
caaaacaaag aagcatttgt ccgctcccag atgtatagta ctgattatga ccagattcta 1740 
cctgattgtt attcttggcc tgaagaggtg cagaaaatac agaccaaagt tgaccagtag 1800 
gataatagca aacatttcta actctattaa tgaggtcttt aaacctttca taatttttaa I860 
aggttggaat cttttataat gattcataag acacttagat taagatttta ctttaacagt 1920 
ctaaaaattg atagaagaat atcgatataa attgggataa acatcacatg agacaaaaaa 1980 
aaaaaaaaaa aa 



780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 




Ctcccctccc agggtgtctg gtggcaggcc tgtgcctatc cctgctgtcc 
ccagggtggg ccccgggggt caggagctcc agaagggcca gctgggcata ttctgagatt 
llJlllill r CCatttCt 9Ct9caaacc tggtcagagc cag?g?tccc tcca?Sgac 
Itlltf^. 9 t 9 cca agtgc ctgcaccgtg gaccacagcc gagccactgg gcagclggtg 
atggtcccac gcaggagcgc tgtggacccc gctctctggg cagccctg?! ItaigcSS 
acacctgcag agcctgggac cacgtggatg ggcagatcct gggccagctg cggcccctX 
cagaggagga agaggaggag ggcgccgggg ccaccttgtc cagggggccf gfcttccccg 
gcatgggctc tgaggagttg cgtctggcct ccttctatga ctS?cgctg actgctgagg 
gctgctggct gctgccggct tcttccacac aggLaLai gacaagSga 
SSS ctgctatggg ggcctgcaga gctggaagcg cggggacga? ?cctgglcgg 
gt " ttcccc agctgtcagt tcctgctccg gtcaaaagga agaglcttg 
tccacagtgt gcaggagaot cactcccagc tgctgggctc ctgggacccg tgggaagaac 
cggaagacgc agcccctgtg gccccctccg tccctgcctc tglgLccc? gSctgccca 
ZattlT* 9 a9a " tcca9 tctgaaagtg cccaggagcc aglgccagg gSgtggagg 
cSSSt? ?2f Ct9 r 9 9a " a9a " a ^gcaaggt gtgcctggal cgcgccgtg 
t tgtgccgtgc ggccacctgg tctgtgctga gtgtgccccc ggcctgcagc 
22ES Ct9Ca9agcc <=ccgtccgca gccgcgtgcg caccttcctg Icctaggcla 
aaafSS?? C '" CCa99t MSotgeag. gtgggotccc tgcccctctc tgcctgltct 
522 ^ 9CCt9C t9a " at " c Wtsgtg tccatccagc actgaccagc 
cctgattccc cgaccaccgc ccagggtgga gaaggaggcc cttgcttggc gtgggggatg 
gcttaactgt acctgtttgg atgcttctga atagaaataa agtjggtttt Sc55w? 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



Homo sapiens drebrin 1, transcript variant 1, mRNA (cDNA clone MGC:1517 
IMAGE: 3356428) , complete cds . 



/translation="MAGVSFSGHRIiELLAAYEEVIREESAADWALYTYEDGSDDLKI^ 
SGEGGLQELSGHFENQKVMYGFCSVKDSQAALPKYVLINWVGED^ 

VAEFFQGVDVI VNAS S VED IDAGAIGQRL SNGIiARL S S PVLHRIiRLREDENAE PVGTTY 

QKTDAAVEMKRINREQFWEQAKKEEELRKEEERKKALDERIjRFEQE 

RRYREREQQIEEHRRKQQTLEAEEAKRRLKEQS I FGDHRDEEEETHMKKS E S EVEEAAA 

IIAQRPDNPREFFKQQERVASASAGSCDVPSPFNHRPGSHLDSHRRMAPTPIP 

SSTASTPVAEQIERAIjDEVTSSQPPPLPPPPPPAQETQEPSPIIjDSEETRAAT^PQAWAG 

PMEEPPQAQAPPRGPGSPAEDLMFMESAEQAVIiAAPVEPATT^^ 

TADTTVANNVPPAATSLIDLWPGNGEGASTLQ^ 

EPLiIiPAGEGCATLIjNFDELPEPPATFCDPEEVEGEPIiAT^PQTPTIiPSALEEIiEQEQEPE 
PHLLTNGETTQKEGTQASEGYFS QS QEEEFAQS EELCAKAPPPVFYNKPPE ID I TCWDA 
DPVPEEEEGFEGGD " 

ccgaggcggc ggcggcgact ccctctttcc ctccctcctc ctccgtccgc ccgtccgtcc 60 

gcgcgtctgt ccgttcggcc cggtccggcc cgaagcatgg ccggcgtcag cttcagcggc 120 

caccgcctgg agctgctggc ggcttacgag gaggtgatcc gagaggagag cgcggccgac 180 

tgggctctgt acacatatga agatggctcc gatgacctca agcttgcagc atcaggagaa 24 0 

gggggcttgc aggagctttc gggacacttt gagaaccaga aggtgatgta cggcttctgc 300 

agtgtcaagg actcccaagc tgctctgcca aaatacgtgc tcatcaactg ggtgggcgaa 360 

gatgtgcctg atgcccgcaa gtgcgcttgt gccagccacg tggctaaggt ggcagagttc 420 

ttccagggtg tcgacgtgat cgtgaacgcc agcagcgtgg aagacataga cgcgggtgcc 480 

atcgggcagc ggctctctaa cgggctggcg cgactctcca gccctgtgct gcaccgactg 540 

cggctgcgag aggatgagaa cgcagagccc gtgggcacca cctaccagaa gacggatgca 600 

gctgtggaaa tgaagcggat taaccgagag cagttctggg agcaggccaa gaaggaagaa 660 

gagctgcgga aggaggagga gcggaagaag gccctggatg agaggctcag gttcgagcag 720 

gagcggatgg agcaggagcg gcaggagcaa gaggagcgcg agcggcgcta ccgggagcgg 780 

gagcagcaga tcgaggagca caggaggaaa cagcagactt tagaagcgga agaggccaag 840 

aggcggttga aggagcagtc tatctttggt gaccatcggg atgaggagga agagacccac 900 

atgaagaagt cagagtcgga ggtggaggag gcagcagcta ttattgccca gcggcctgac 960 

aacccaaggg agttcttcaa gcagcaggaa agagtcgcat cggcctctgc gggcagctgt 1020 

gatgtaccct cgcccttcaa ccatcgacca ggcagccacc tggacagcca ccggaggatg 1080 

gcgcccactc ccatccccac gcggagcccg tctgactcca gcaccgcctc cacccctgtc 1140 

gctgagcaga tagagcgggc cctggatgag gtcacctcct cgcagcctcc accactgcca 1200 

ccgccacccc caccagccca agagacccag gagcccagcc ccatcctaga cagtgaggag 1260 

accagagcag cagcccctca ggcctgggcc ggccccatgg aggagccccc tcaggcacag 1320 

gcgcctcccc gggggccagg cagccctgca gaggacttga tgttcatgga gtctgcagag 1380 

caggctgtcc tggctgctcc cgtggagcct gccacagctg acgccacgga ggtccacgat 144 0 

gcagctgaca ccattgaaac tgacactgcc actgctgaca ccactgttgc caacaacgta 1500 

ccccccgccg ccaccagcct cattgaccta tggcctggca acggggaagg ggcctccaca 1560 

ctccagggtg agcccagggc ccccacgcca ccctcgggta ctgaggtcac cctggcagag 162 0 

gtgcccctgc tggatgaggt ggctccggag ccactgctgc cagcaggcga aggctgtgcc 1680 

acccttctca actttgatga gctgcctgag ccgccagcca ccttctgtga cccagaggaa 174 0 

gtggaagggg agcccctggc tgccccccag accccaactc tgccctcagc ccttgaggag 1800 

ctggagcaag agcaggagcc ggagccccac ctgctaacca atggcgagac cacccagaag 1860 

gaggggaccc aggccagtga ggggtacttc agtcaatcac aggaggagga gtttgcccaa 192 0 

tcggaagagc tctgtgccaa ggctccgcct cctgtgttct acaacaagcc tccagagatc 1980 

gacatcacat gctgggatgc agacccagtt ccagaagagg aggagggctt cgagggtggt 2040 

gattagcggt ggcgccagcc ctaggctacc cttgccaagg ccgcccacct gcatcagcct 2100 

ctggccagac ggcccgccgt gcctgcattc gcagcagctc cgcctggcac ccactccgga 2160 

ttccggccct ggctggggac ttggccgctt ccctacccac agggcctgac ttttacagct 2220 

tttctctttt tttaaaaagt tgataggaga cttgtacagt tgactggctt tcctctcgtt 2280 

ggtagttgag acgctgttgc aaattccacc cctccttccc tggtccagat tgtagctctt 2340 

agtcctccct gctcagctgg ccgggttgga ggcctcaccc tgcttggggc ctggcgtggg 2400 

gggagctctg gtgggaaaat gtcccccacc tcttttccta gttttatgtt tcttgggaaa 2460 



atatcacttt gtattctctg tccagggctt cagatatttt gcacgaattt taaaacatgg 2520 
caataaatgg ctcgtgggct ctggcaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2580 
aaaaaaaaaa aaa ocqo 



Homo sapiens MDS019 (MDS019) mRNA, complete cds. 

/ translation " MKPHPMWVEIiMyRDTFS VNFYNRPIIjSRBNTVWLCYEVIcrKGPS 
RPPLDAKIFRGQVYSEI»KYHPEMRFFHWFSKWRKIjHRDQKYBVTWYISWSPCTKCTRDM 
ATFLAEDPKVTLT I FVARLYYFWDPDYQEAIjRSLCQKRDGPRATMKIMNYDEFQHC WS K 
FVYSQRELFEPWNNLPKyYILiaiMI^EILRHSMDPPTFTFNFMNEPWVRGRHETYLCY 

EVBR^Q^lroTWVllIlNQIlRGFIlCNQAPHKHGFIJEGRHAEIlCFlJDVI PFWKLDliDQDYRVTC 
FTSWSPCFSCAQEMAKFISKNKHVSLCIFTARIYDDQGRCQEGLRTIAEAGAKISIMTY 

SEFKHCWDTFVDHQGCPFQPWDGIiDEHSQDLSGRLRAILQNQEN" 



ctpccaqggg gagggcccca gagaaaacca gaaagagggt gagagactga ggaagataaa 
gcgtcccagg gcctcctaca ccagcgcctg agcaggaagc gggaggggcc atgactacga 
Lccctggga ggtcacttta gggagggctg tcctaaaacc agaagcttgg agcagaaagt 
lLaccSgg tgctccagac aaagatctta gtcgggacta gccggccaag gatgaagcct 
lacttcagaa acacagtgga gcgaatgtat cgagacacat tctcctacaa atttUM 
agacccatcc tttctcgtcg gaataccgtc tggctgtgct acgaagtgaa aacaaagggt 
ccctcaaggc cccctttgga cgcaaagatc tttcgaggcc aggtgtattc cgaacttaag 
taccacccag agatgagatt cttccactgg ttcagcaagt ggaggaagct gcatcgtgac 
caggagtatg aggtcacctg gtacatatcc tggagcccct gcacaaagtg tacaagggat 
atggccacgl tcctggccgl ?gacccgaag gttaccctga ccatcttcgt tgcccgcctc 
tactacttct gggacccaga ttaccaggag gcgcttcgca gcctgtgtca gaaaagagac 
ggtccgcgtg ccaccatgaa gatcatgaat tatgacgaat ttcagcactg ttggagcaag 
??cgtgtaca gccaaagaga gctatttgag ccttggaata atctgcctaa atattatata 
ttactgcaca tcatgctggg ggagattctc agacactcga tggatccacc «cattcact 
ttcaacttta acaatgaacc ttgggtcaga ggacggcatg agacttacct gtgttatgag 
gtggagcgca tgcacaatga cacctgggtc ctgctgaacc agcgcagggg ctttctatgc 
aaccaggctc cacataaaca cggtttcctt gaaggccgcc atgcagagct gtgcttcctg 
gacgtgattc ccttttggaa gctggacctg gaccaggact acagggttac ctgcttcacc 
Lw» cctgcttcag ctgtgcccag gaaatggcta aattcatttc aaaaaacaaa 
cacgtgagcc tgtgcatctt cactgcccgc atctatgatg atcaaggaag atgtcaggag 
gggctgcgca ccctggccga ggctggggcc aaaatttcaa taatgacata cagtgaattt 
aagcactgct gggacacctt tgtggaccac cagggatgtc ccttccagcc ctgggatgga 
ctlgatgagc acagccaaga cctgagtggg aggctgcggg ccattctcca gaatcaggaa 
aactgaagga tgggcctcag tctctaagga aggcagagac ctgggttgag cctcagaata 
aaagatcttc ttccaagaaa tgcaaacagg ctgttcacca ccatctccag ctgatcacag 
acaccagcaa agcaatgcac tcctgaccaa gtagattctt ttaaaaatta ^gtgcatta 
ctttgaatca aaaatttatt tatatttcaa gaataaagta ctaagattgt gctcaataca 
cagaaaagtt tcaaacctac taatccagcg acaatttgaa tcggttttgt aggtagagga 
ataaaatgaa atactaaatc tttctgtaaa aaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1717 



DE Human carnitine palmitoyltransferase I itiMJA, nuclear gene encoding 
DE mitochondrial protein, complete cds. ^ 



?T 
?T 
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300 
360 
420 



Jl /trans la tion= "MAEAHQAVAFQFTVTPDGVDFRLSREALKHVYIiSGXNSWKKRLIR 

• 1 I ^ GILRC ^PGSPTSWLVVIMATVGSSFCaTVDISLGLVSCIQRCLPQGCGPYQTPQTR 

ALLS MAI FSTGVWVTG I PFFRQTLKLLLCYHGVWFEMHGKTSNLTRIWAMC IRLLS SRH 
PMLYSPQTSLPKLPVPRVSATIQRYIiESVRPLLDDEEYYRMELLAKEFQDKTAPRLQKY 
LVLKSWWASNYVSDWWEEYIYIiRGRSPLMVNSNYyVMDLVLIKNTDVQAARLGNI I HAM 
IMYPJUOiDP^EIKPVMALGIVPMCSYQMERMFimTilPGKDTDVLQHLSDSRHVAVYHK 
GRFFKLVJLYEGARLLKPQDLEMQFQRILDDPSPPQPGEEKLAALTAGGRVEWAQARQAF 
FSSGKNKAALEAIERAAFFVALDEESYSYDPEDEASLSLYGKALLHGNCYNRWFDKSFT 
LISFKNGQLGLNAEHAWADAPIIGHLWEFVLGTDSFHLGYTETGHCLGKPNPALAPPTR 
LQWDIPKQCQAVIKSSYQVAKALADDVELYCFQFLPFGKGLIKKCRTSPDAFVQIALQL 
AHFRDRGKFCLTYEASMTRMFREGRTETVRSCTSESTAFVQAMMEGSHTKADLRDLFQK 
AAKKHQNMYPJJAMTGAGIDRHLFCLYLVSKYLGVSSPFLAEVIjSEPWRLSTSQIPQSQI 

RMFDPEQHPNHLGAGGGFGPVADDGYGVSYMIAGENTIFFHISSKFSSSETNAQRFGNH 
' T IRKALLDIADLFQVPKAYS " 

ccgcgcaccc atctgccccc gtcctaggtg ccgaccaacc cccaggatgg cggaagctca 60 

ccaggccgtg gccttccagt tcacggtgac cccagacggg gtcgacttcc ggctcagtcg 120 

ggaggccctg aaacacgtct acctgtctgg gatcaactcc tggaagaaac gcctgatccg 180 

catcaagaat ggcatcctca ggggcgtgta ccctggcagc cccaccagct ggctggtcgt 240 
catcatggca acagtgggtt cctccttctg caacgtggac atctccttgg ggctggtcag 
ttgcatccag agatgcctcc ctcaggggtg tggcccctac cagaccccgc agacccgggc 
acttctcagc atggccatct tctccacggg cgtctgggtg acgggcatct tcttcttccg 

ccaaaccctg aagctgcttc tctgctacca tgggtggatg tttgagatgc atggcaagac 480 

cagcaacttg accaggatct gggctatgtg tatccgcctt ctatccagcc ggcaccctat 540 

gctctacagc ttccagacat ctctgcccaa gcttcctgtg cccagggtgt cagccacaat 600 

tcagcggtac ctagagtctg tgcgcccctt gttggatgat gaggaatatt accgcatgga 660 

gttgctggcc aaagaattcc aggacaagac tgcccccagg ctgcagaaat acctggtgct 720 

caagtcatgg tgggcaagta actatgtgag tgactggtgg gaagagtaca tctaccttcg 780 

aggcaggagc cctctcatgg tgaacagcaa ctattatgtc atggaccttg tgctcatcaa 840 

gaatacagac gtgcaggcag cccgcctggg aaacatcatc cacgccatga tcatgtatcg 900 

ccgtaaactg gaccgtgaag aaatcaagcc tgtgatggca ctgggcatag tgcctatgtg 960 

ctcctaccag atggagagga tgttcaacac cactcggatc ccgggcaagg acacagatg? 1020 

1 11^. 9C t° ^° tca 9 aca gc«=3gcaog t ggctgtctac cacaagggac gcttcttcaa 1080 

gctgtggctc tatgagggcg cccgtctgct caagoctcag gatctggaga tgcagttcca 1140 

gaggatcctg gacgacccct ccccacctca gcctggggag gagaagctgg cagccctcac 1200 

tgcaggagga agggtggagt gggcgcaggc acgccaggcc ttctttagct ctggaaagaa 1260 

taaggctgcc ttggaggcca tcgagcgtgc cgctttcttc gtggccctgg atgaggaatc 1320 

ctactcctat gaccccgaag atgaggccag cctcagcctc tatggcaagg ccctgctaca 1380 

tggcaactgc tacaacaggt ggtttgacaa atccttcact ctcatttcct tcaagaatgg i 440 

ccagttgggt ctcaatgcag agcatgcgtg ggcagatgct cccatcattg ggcacctctg 1500 

ggagtttgtc ctgggcacag acagcttcca cctgggctac acggagaccg ggcactgcct is 60 
gggcaaaccg aaccctgcgc tcgcacctcc tacacggctg cagtgggaca ttccaaaaca 
gtgccaggcg gtcatcaaga gttcctacca ggtggccaag gcgttggcag acgacgtgga 
gttgtactgc ttccagttcc tgccctttgg caaaggcctc atcaagaagt gccggaccag 
ccctgatgcc tttgtgcaga tcgcgctgca gctggctcac ttccgggaca ggggtaagtt 

ctgcctgacc tatgaggcct caatgaccag aatgttccgg gagggacgga ctgagactgt i860 

gcgttcctgt accagcgagt ccacagcctt tgtgcaggcc atgatggagg ggtcccacac 1920 

aaaagcagac ctgcgagatc tcttccagaa ggctgctaag aagcaccaga atatgtaccg 1980 

cctggccatg accggggcag ggatcgacag gcacctcttc tgcctttact tggtctccaa 2040 

gtacctagga gtcagctctc ctttccttgc tgaggtgctc tcggaaccct ggcgtctctc 2100 

caccagccag atcccccaat cccagatccg catgttcgac ccagagcagc accccaatca 2160 

cctgggcgct ggaggtggct ttggccctgt agcagatgat ggctatggag tttcctacat 2220 

gattgcaggc gagaacacga tcttcttcca catctccagc aagttctcaa gctcagagac 2280 

gaacgcccag cgctttggaa accacatccg caaagccctg ctggacattg ctgatctttt 2340 



1620 
1680 
1740 
1800 



2400 



ccaagttccc aaggcctaca gctgaagccc ttaggtacct Stgttttgtt tgggaactcg 

2520 

gtaggctctg agltagctgt ccacgcccac gtgtttgctt ggaataaata cttgcc 2576 



gaggccctcc ccctccccca gctcagacca cagaggtggc aagagaaggg ctgaagctgg 
aagactgttc atgagggact tgtgtgacct gctttgaaat gtgtgactct gctgagtgac 2520 



Homo sapiens prostate differentiation factor mRNA, complete cds. 

/trans lation= "MPGQEI^TLNGSQMLLVLLVLSWLPHGGALSLAEASRASFPGPSE 
MSEDSRFRELRKRYEDLLTOIJ^QSWEDSNTOLVPAPAWILTPEVRLGSGGHIfl^ 
I SRAAL PEGL P EASRIiHRALFRL S PTASRS WDVTRPLRRQLS LARPQAPALHLRL S P P P 

SQSDQLLAESSSARPQLELHLRPQAARGRRRARM^GDHCPLGPGRCCIRIiHTVRASIiED 

LGWADWVLSPREVQVTMCIGACPSQFRAAMMHAQIKTSI^HPiKPDTVPAPCCVPASYNP 
MVLIQKTDTGVSLQTYDDLIiAKDCHCI " 



agcgtttaaa cttaagcttg gagttatttc caccatgccc gggcaagaac tcaggacgct 
gaatggctct cagatgctcc tggtgttgct ggtgctctcg tggctgccgc atgggggcgc 
ZSESZ ^ C9a " C3a Scogcgc-a tttcccggga cc?tcagagt tgcac?ccga 
agactccaga ttccgagagt tgcggaaacg ctacgaggac ctgctaacca ggctgcgggc 
caaccagagc tgggaagatt cgaacaccga cctcgtcccg gcccctgcag tccggatact 
cacgccagaa gtgcggctgg gatccggcgg ccacctgcac ctgcgtatc? ctcSgccgc 
ccttcctgag gggctccccg aggcctcccg ccttcaccgg gctctgttcc ggctgtcccc 
gacggcgtca aggtcgtggg acgtgacacg accgctgcgg cgtcagctca gccttgcaag 
accccaggcg cccgcgctgc acctgcgact gtcgccgccg ccgtcgcagt cggaccaact 
T^taZ? tCttC9tcc 9 cacggcccca gctggagttg dtticgje cgaagccgc 
cgcagagcgc gtgcgcgcaa cggggaccac tgtccgctcg ggcccgggcg 
SSSS ctgcacacgg tccgcgcgtc gctggaagac ctgggctggg ccgat?igg? 
SZZEZ y 81698 aa 9tgaccat gtgcatcggc gcgtgcccga gccagtlccg 
ggcggcaaac atgcacgcgc agatcaagac gagcctgcac cgcctgaagc ccgacacgg? 

S tgctgcgtgc ccgccagcta caatcccatg g?gctcat?c aaLgaccga 
JIS2!2 tC9ctcca 9 a cctatgatga cttgttagcc aaagactgcc actgcatatg 
aactagtact aagccgaatt ctgcagatat cc ay 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
992 



Homo sapiens amphiphysin II raRNA, complete cds. 



/ trans lat ion= " MAEMGSKGVTAGKIASNVQKKLTRAQEKVLQKIiGKADETKDEQFE 
QC^QNFNKQIiTEGTRLQKDIiRTYIJ^VKAMHE^ 

ENttTOLLWMDYHQKIjVDQALLTMDTYIjGQF PDI KSRIAKRGRKLVDYD S ARHHYE SLQTA 
KKKDEAKIAKP VSLLEKAAPQWCQGKLQAHLVAQTNIJjRNQAEEEL I KAQKVFEEMNVD 
LQEELPSLWNSRVGF YVNTFQS I AGLEENFHKEMS KXjNQNLNDVIjVGLEKQHGSNTPTV 
KAQPSDNAPAKGNKSPSPPDGSPAATPEIRVNHEPEPAGGATPGATLPKSPSQLRKGPP 
VPPPPKHTPSKEVKQEQILSIiFEDTFVPEISVTTPSQFEAPGPFSEQASLlJDIiDFDPLP 
PVTSPVKAPTPSGQSIPWDLWEPTESPAGSLPSGEPST^AEGTFAVSWPSQTAEPGPAQP 
AEASEVAGGTQPAAGPQEPGETAASEAAS S S LPAVVVETFPATVNGTVEGGS GAGRLDL 
PPGFMFKVQAQHDYTATDTDELQLKAGDVVLVIPFQNPEEQDEGWIJ4GVTCESDWNQHKE 

LEKCRGVFPENFTERVP " 



ccgggcgagg cctgcgccgc gatggcagag atgggcagta aaggggtgac ggcgggaaag 
atcgccagca acgtgcagaa gaagctcacc cgcgcgcagg agaaggttct ccagaagctg 
gggaaggcag atgagaccaa ggatgagcag tttgagcagt gcgtccagaa tttcaacaag 
cagctgacgg agggcacccg gctgcagaag gatctccgga cctacctggc ctccgtcaaa 
gccatgcacg aggcttccaa gaagctgaat gagtgtctgc aggaggtgta tgagcccgat 
tggcccggca gggatgaggc aaacaagatc gcagagaaca acgacctgct gtggatggat 
taccaccaga agctggtgga ccaggcgctg ctgaccatgg acacgtacct gggccagttc 
cccgacatca agtcacgcat tgccaagcgg gggcgcaagc tggtggacta cgacagtgcc 
cggcaccact acgagtccct tcaaactgcc aaaaagaagg atgaagccaa aattgccaag 
cctgtctcgc tgcttgagaa agccgccccc cagtggtgcc aaggcaaact gcaggctcat 
ctcgtagctc aaactaacct gctccgaaat caggccgagg aggagctcat caaagcccag 
aaggtgtttg aggagatgaa tgtggatctg caggaggagc tgccgtccct gtggaacagc 
cgcgtaggtt tctacgtcaa cacgttccag agcatcgcgg gcctggagga aaacttccac 
aaggagatga gcaagctcaa ccagaacctc aatgatgtgc tggtcggcct ggagaagcaa 
cacgggagca acaccttcac ggtcaaggcc cagcccagtg acaacgcgcc tgcaaaaggg 
aacaagagcc cttcgcctcc agatggctcc cctgccgcca cccccgagat cagagtcaac 
cacgagccag agccggccgg cggggccacg cccggggcca ccctccccaa gtccccatct 
cagctccgga aaggcccacc agtccctccg cctcccaaac acaccccgtc caaggaagtc 
aagcaggagc agatcctcag cctgtttgag gacacgtttg tccctgagat cagcgtgacc 
accccctccc agtttgaggc cccggggcct ttctcggagc aggccagtct gctggacctg 
gactttgacc ccctcccgcc cgtgacgagc cctgtgaagg cacccacgcc ctctggtcag 
tcaattccat gggacctctg ggagcccaca gagagtccag ccggcagcct gccttccggg 
gagcccagcg ctgccgaggg cacctttgct gtgtcctggc ccagccagac ggccgagccg 
gggcctgccc aaccagcaga ggcctcggag gtggcgggtg ggacccaacc tgcggctgga 
ccccaggagc caggggagac ggcggcaagt gaagcagcct ccagctctct tcctgctgtc 
gtggtggaga ccttcccagc aactgtgaat ggcaccgtgg agggcggcag tggggccggg 
cgcttggacc tgcccccagg tttcatgttc aaggtacagg cccagcacga ctacacggcc 
actgacacag acgagctgca gctcaaggct ggtgatgtgg tgctggtgat ccccttccag 
aaccctgaag agcaggatga aggctggctc atgggcgtga aggagagcga ctggaaccag 
cacaaggagc tggagaagtg ccgtggcgtc ttccccgaga acttcactga gagggtccca 
tgacggcggg gcccaggcag cctccgggcg tgtgaagaac acctcctccc gaaaaatgtg 
tggttctttt ttttgttttg ttttcgtttt tcatcttttg aagagcaaag ggaaatcaag 
aggagacccc caggcagagg ggcgttctcc caaagattag gtcgttttcc aaagagccgc 
gtcccggcaa gtccggcg 



60 
120 
180 
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DE 602149641F1 NIH_MGC_81 Homo sapiens cDNA clone IMAGE : 4290707 5-, mRNA 
DE sequence . 

aat 9^ gat ^ tagttgacgc tccccaaagt gcatgagaca catgctaaaa 60 

ttacaaatta aaatttgggt cagacttgcc ataatgatag actcaattta gctctctgaa 12 0 

toaSSoJ STT" tttaattCCC -ttgggcgt tggtgtacat LaaatgLa Ho 

tgagaagtgt gtatgctgac caaaccacaa gaaactttct ttaagtggtg ttaaagagga 24 0 

"?!=" a !^ "ccaagcgg tgttacattg aaaatggtaa cagagcagcg tgcttccacc loo 

nitwi t a 9 a ^gtggga accacagcag aagttataga gcgacaactt atatacacac 360 

ctagaaggta agttaaacaa aataccggct tccagagacc ccttttctcc agccatatta 42 0 

gaagtaatta aggtggattt atttcatcta caagcagttg gtccctaagt 480 

Itt^lttl, 9"f aaaaac aaaacggacc aaacagtgtg gggaaaaatt tccatgtgct 540 

tctgtgaagc ttatgtggta cacgtgggcc atttctaatc ttctctgggg ggagcggcca 600 

cagacctgtg ttcgggtgaa cctcttaatt cctgagtctt taccaataci gttcoSggJ 660 

gcgcggggaa cgccttggat atgccaggtc agaaaggggg ctcgatatgg gttgcccag? 720 

tcctggggca ccttgctttg aaacacccct catttttgc 759 




Human global transcription activator homologous sequence mRNA, complete 
cds. 

/ trans 1 at ion= " MDPEYEEKMKADRAKRFEFLLKQTELFAHFIQPS AQKSPTSPLNM 
KIiGRPRIKKDEKQSLISAGDYRHRRTEQEEDEELIjSESRKTSNV^ 

IjRDYQIRGIjNWIj I SLYENGVNG I IiADEMGLGKTIiQTIALLGYliKHYRNI PGPHMVLVPK 
STLHNWMNEFKRWVPSLRVICFV^ 
KKFHWRYLVIDEAHRIKNEKSKLSEIVREFKST^ 
DVFNSADDFDSWFDTKNCLGDQKLVERLHAVIjK^ 

LSKMQREWYTKI LMKD IDVLNS SGKMDKMRI1LNILMQLRKCCNHPYX1FDGAEPGPPYTT 
DEHIVSNSGKMVVLDKLIiAKLKEQGSRVLIFSQMTR^ 

PHEEREDKFLE VEFLGQREAI EAFNAPNS SKFI FML S TRAGGLG INLAS AD WI L YD S D 

WNPQVDLQAMDRAHRIGQKKPWVFRLITDN^ 

QRSNKIJtfCEEMLQMIRHGATHVFASKESELTO 

SLRNFRMD IEQS LYKFEGED YREKQKLGMVEWIEPPKRERKANYAVDAYFREALRVSEP 
KIPKAPRPPKQPNVQDFQFFPPRLFELLEKEILYYRKTIGYKVPRNPD I PNPAIiAQREE 
QKKIDGAEPLTPEETEEKEKLLTQGFTNWTKRDFNQFIKANEKYGRDDIDNIAREVEGK 
SPEEVMEYSAVFWERCNELQDIEKIMAQIERGEARIQRRISIKKAIjDAKIARYKAPFHQ 
LRIQYGTSKGKNYTEEEDRFIiIOyiliHKMGFDRENVYEELRQCW 
AMEFQRRCNTL I SL I EKENME I EERERAEKKKRATKTPMVKF s AF S " 

caagactgga agcagagaga gagagcaaga gtgagagaga gcgagcgagc gtagtcagga 
gatggtgttg tattccagga gaaaaacgtt tcttcatttc aactcaaact tgctgctaaa 
gcgcctaaat ctgaaaagga aatggaccca gaatatgaag agaaaatgaa agccgaccga 
gcaaagagat ttgaattttt actgaagcag acagaacttt ttgcacattt cattcagcct 
tcagcacaga aatctccaac atctccactg aacatgaaat tgggacgtcc ccgaataaag 
aaagatgaaa agcagagctt aatttctgct ggagactacc gccataggcg cacagagcaa 
gaagaagatg aagagctact gtctgagagt cggaaaacat ctaatgtgtg tattagattt 
gaggtgtcac cttcatatgt gaaagggggg ccactgagag attatcagat tcgaggactg 
aattggttga tctctttata tgaaaatgga gtcaatggca ttttggctga tgaaatgggc 
cttgggaaaa ctttacaaac aattgctttg cttggttacc tgaaacacta ccgaaatatt 
cctggacctc acatggtttt agttccaaag tctactttac acaactggat gaatgaattt 
aaacgatggg tcccatctct ccgtgtcatt tgttttgtcg gagacaagga tgccagagct 
gcttttattc gtgatgaaat gatgccagga gagtgggatg tttgcgttac ttcttatgag 
atggtaatta aagaaaaatc tgtattcaaa aagtttcact ggcgatacct ggtcattgat 
gaagctcaca gaataaagaa tgaaaaatct aagctttcag agattgttcg tgagttcaag 
tcgactaacc gcttgctcct aactggaaca cctttgcaga ataacctgca tgaactgtgg 
gccttactca actttttatt gcctgatgtc tttaattctg cagatgactt tgattcttgg 
tttgacacta aaaattgtct tggtgatcaa aaactcgtgg aaagacttca tgcagtttta 
aaaccatttt tgttacgccg tataaaaact gatgtagaga agagtctgcc acctaaaaag 
gaaataaaga tttacttggg gctgagtaag atgcaacgag aatggtatac aaaaatcctg 
atgaaagata ttgatgtttt aaactcttct ggcaagatgg acaagatgcg actcttaaac 
attctgatgc agcttcgaaa gtgttgtaat catccatatc tgtttgatgg tgctgaacct 
ggtccacctt ataccactga tgagcatatt gtcagcaaca gtggtaaaat ggtagttctg 
gataaactat tggccaaact caaagaacag ggttcaaggg ttctcatttt cagccagatg 
actcgcttgc tggatatttt ggaagattat tgcatgtggc gtggttatga gtattgtcga 
ctggatggac aaaccccgca tgaagaaaga gaggataaat tcctagaagt ggaatttctg 
ggtcaaaggg aagcaataga ggcttttaat gctcctaata gtagcaaatt catctttatg 
ctaagtacca gggctggagg tctcggaatt aacctggcaa gtgctgatgt ggttatacta 
tatgattcag actggaaccc acaggttgat ctacaagcta tggatcgagc acatcgtatt 
ggtcagaaga aaccagtacg tgtattccgt ctcatcactg acaacactgt tgaagagagg 
attgtagaaa gagctgagat aaaactgaga ctcgattcaa ttgttataca acaaggaaga 
- ctcattgacc aacggtctaa caagctggca aaagaggaaa tgttacaaat gatacggcat 
ggagccaccc atgtttttgc ttctaaagag agtgagttga cagatgaaga cattacaact 
attctggaaa gaggggaaaa gaagactgca gagatgaatg aacgcctgca aaaaatggga 
gagtcttctc taagaaattt tagaatggac attgaacaaa gtttatacaa atttgaggga 
gaagattata gagaaaaaca gaagcttggc atggtggaat ggattgaacc tcctaaacga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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1020 
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1140 
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1260 
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1680 
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gaacgcaaag caaactacgc agtggatgcc tactttagag aggctttgcg tgtcagcgag 2220 

ccaaagattc caaaggctcc acggcctcca aaacagccaa atgttcagga ttttcaattt 2280 

ttcccaccac gcttatttga gctcctggaa aaggaaattc tttattatcg gaagacaata 2340 

ggctataagg ttccaaggaa tcctgatatc ccaaatccag ctctggctca aagagaagag 2400 

caaaaaaaga ttgatggagc tgaacctctt acaccagaag agactgaaga aaaggaaaaa 2460 

cttctcacac aaggtttcac aaactggact aaacgagatt ttaaccagtt tattaaagct 2520 

aatgagaaat atggaagaga tgacattgat aacatagctc gagaggtaga gggcaaatcc 2580 

cctgaggagg tcatggagta ttcagctgta ttttgggaac gttgcaatga attacaggac 2 640 

attgagaaaa ttatggctca aattgaacgt ggagaagcaa gaattcaacg aaggatcagt 2700 

atcaagaaag ccctggatgc caaaattgca agatacaagg ctccatttca tcagttgcgc 2760 

attcagtatg gaaccagcaa aggaaagaac tatactgagg aagaagatag attcttgatt 2 820 

tgtatgttac acaaaatggg ctttgataga gaaaatgtat atgaagaatt aagacagtgt 2 880 

gtacgaaatg ctccccagtt tagatttgac tggtttatca agtctaggac tgccatggaa 2940 

ttccagagac gctgtaacac tctgatttca ttgattgaga aagaaaatat ggaaattgag 3 000 

gaaagagaga gagcagaaaa gaagaaacgg gcaactaaaa ctccaatggt aaaattttca 3 060 

gcattttcct aacttttaga tttaacattg ttgggccatt taaaatgtgc atattggagc 3120 

agaacattaa atctgtttcc attttagtca cagaaaagaa aagcagagtc agctactgag 3180 

agctctggaa agaaggatgt caagaaggtg aaatcctaaa gcctagaaat aaagttttaa 3240 

atgggaaact gctattttct tgttcccatc ttcaaatgct aattgccagt tccagtgtat 3300 

tcatggtact ctaagaaaaa tctctttggt tttgatttct tgcatatttt atatatttta 3360 

caatgctttc tacctgaaat gtgtagcttt atattttatg gcattctagt atttttgtgt 3420 

actgtatttt gtgcatttca tgtcttcatc aaaatcctct cagtccttgt tcttttgaag 3480 

cttgtgctga ggttttagct tttctatgtt ttatatgccg ctgctttgaa agagaaccta 354 0 

gattctatag ttgtattatt gttgtttcat actttaaatt tatatggctg tggaaaaacg 3 600 

aattaaaatg ttttgaggag aaagaaaaaa aaaa * ~ ~ 3634 




tbGOaOl.xl NCI_CGAP_Brl5 Homo sapiens cDNA clone IMAGE : 2058696 3' s 
to gb:M84739 CALRETICULIN PRECURSOR (HUMAN);, mRNA sequence. 

tatacggctg cgagaagacg acagaagggg acagaggcaa gaaaagatgt tgatcaagaa 
agatgagaac caggggtgag ggctgaagga gaatcaaaga taaaatacca gtttaaaaaa 
aaaaaaaaaa aaaaaaaagt cgtatcga 



»E tu04d02.xl NCI_CGAP_Pr28 Homo sapiens cDNA clone IMAGE: 2250051 3' mRNA 
»E sequence. f 



60 



tttttacaag ggggaaaatt atgtatttat ttacacaaat atgcacagaa cacttgtatc 

tttcaaaagt cacacttaag acatagtaaa agcatgttgt atgaaccatg tattcttaag 12 0 

gattgagcaa actgcaggct gcttgctgcc ttttaggttt gctagtccct gatctacttg 180 

aaacagatgt tgcttgcccc aacactagtt taattataag ggcagcctgt gagaaagttt 240 

caatagacat ttttctcacc tatattgcac gtttttctga agcccttggg caagtgtgtg 300 

tgccatgtgt agttctattt acatataaac gctactttaa aagtttatca aaatcatgag 360 

tttttacaaa agtttttaat gctcttctgc attatatgta gcattgcaaa tctgcaaagt 420 

agtaaaacta taaagcacct ttaggtttgc accagttatt acagaaatgg ggatttgtga 480 

aaaggatgta atttgatgta gaagggcaaa gtcctttaat gactggcatt caagaggatt 540 
acttaaaaca 



550 



Homo sapiens mRNA for KIAA0895 protein, partial cds. 

/translation= n SSYGLVEALLTVSVGTSSSALTLLIIiIjQVAESRIiPPRPRHRGPGQ 

RIiEAARPCAALSPEPAEPAAPGLGATMIiE S IRVTEKLHWPEQEIiAKKS ILNAEDSL I ID 

NKRSISHLSSGVXjKDIFTTGTSSYIWLLQSKEKKKYHSQKQSSSTYS 

S RS KDPRRMKAIiVP VTS S GTWYCLERRP AVFVTS SVS S PVKFTHD I SVTGNGI VLPPKP 

KSKVTCWOTFSTIiPKPK^QLSRSFEKGDDFSGKKFCILTAIKPTNLEKEK^ 

NPQFEYANPAIiPSVLAKHSHASDRFLKQIVVHLTC^ 

REHWLEGMLRHEIGTHYFRGINNLQQPWNSWTGRKKHE 

DPFLWRAALIjYYTVYQASQMSFCELFKDIGRFVKDPNTROT 

S KD QVYLDG I LQ I LRYRDT I DFHLLTALGKVS YEDVDRIiKGLAVTENMR VPHFLQDHGR 
YMEHLEKIMEVNELTDRELKDLI " 

ccagtagcta cgggctggtg gaagcgctgc tcacagtcag tgtcggtacg tcttcgtccg 60 

cccttaccct ccttatcctg ctccaggtag cggagagccg cctcccgccg cgcccccgcc 120 

accgaggccc gggtcagagg ctagaggccg cccgcccctg cgcggccctt tcccctgagc 180 

ccgcggagcc cgccgccccg ggcctcgggg cgacgatgct ggagtccatt cgcgtgacgg 240 

aaaagcttca ctggcctgag caagaacttg ctaagaagtc tattctaaat gcagaagatt 300 

cattgatcat tgacaacaaa agaagcattt cacatttgtc ctcgggagtg ctaaaagaca 360 

ttttcacaac tggaaccagt agttacaatg tcctactaca gagcaaggag aaaaaaaagt 420 

atcattcaca aaaacagtct tcctccacct actccaaaag atgtagaaaa cccagcaaat 480 
ctcctaacac ttctcgtagc aaagatcctc gcaggatgaa agccctggtg cctgtgacaa 540 
gcagtggtac ttggtactgc ctggagaggc ggcctgctgt ttttgtcact agttcagtgt 600 
caagtcctgt aaagtttaca catgatatct ctgttacagg gaatggcata gtactgccac 660 
ctaaacccaa aagcaaggtc aagtggtgcc atttctccac tcttccaaag ccaaagcctc 720 
agctgtctag aagctttgaa aagggagatg acttttctgg gaagaaattt tgtatattga 780 
ctgctataaa acccaccaac ttagagaaag aaaaactgag attcttcaaa tctgactata 840 
cctacaatcc tcagtttgag tatgccaatc ctgctctgcc aagcgtatta gctaagcata 900 
gccacgcatc tgaccgattt cttaagcaga ttgtagttca tctcactgag gacctgcttt 960 

cccgagcgtc aatgacagta gtaaatggat gtccgactct gactatcaat gtgtccactg 1020 

cacgtgagca ttggctggag ggaatgctga ggcatgaaat aggtacacat tattttcgag 1080 

gtattaacaa cctccagcag ccatggaaca gttggactgg acgtaaaaaa catgagctaa 1140 

agccaaataa tcccacagag gaaggactag caagcattca cagtgttctg tttagaaaag 1200 

accctttttt atggagggct gccctcctct actacactgt fctatcaagcc agccaaatgt 1260 

ctttttgtga actctttaaa gatattggca ggtttgtcaa ggaccccaat acaagatggg 1320 

attattgtgt acgggccaag aggggatgga ctgatacttc ccaaccaggg tgttttagta 1380 

aagaccaggt atacttggat ggaattcttc aaatcctccg atacagagat accatagact 1440 

tccatctgct gactgctctc gggaaggttt cttatgaaga tgtggatcgc ttaaaaggat 1500 

tggcagttac cgaaaacatg agggtccctc atttcctgca ggaccatggc cgatatatgg 1560 

aacacttaga gaagatcatg gaagtgaatg aactgactga cagggaactg aaagatctta 1620 

tatagtaatt agcgttctgg caaacatagc taagctatgc ctccatgtat attaccagtt 1680 

aggtgcagtt agcaccagaa gatttataaa agaagaaaga ctacttgtgt tttctgaaga 1740 

agggtcttca gtattcagcg gaatttttag gttaagtaca gatcttaaac tatttcccta 1800 

aaatgtttct ataggctgca gggggaagtt attcctattt tctgaatctc gacagagtca 1860 

gatgaaaata ctactgctga gcatttttga agactcttgg tcaaattgca tgataaattt 1920 

ctgcctgagc agtaagcact ggcctagtgc ttctgcctaa atatggaggt cagctccaac 1980 

tggagactgg ctgaattcca ttgctgttca gactccaaag ttatatttta tttgataaat 2040 

aatggtaatt attctccttt gaaaaattag ttttgtgttg ctccaaaaag ctagctatat 2100 

atctcagctt tcttatttct tttatatgtt gagggatttt aaagggagag gaaaagcaat 2160 

ttgaaaattt ttcattaagt gttttatttt aacattattg tttcacctta tcagcttata 2220 

ggaactaaat tagagaatca cctctttttg tccggttatc tcttaactat gttttcattt 2280 

gctgaaacat attgtaccct tttaaatttt ttacagagtt ttaacgtctt ttccactgca 2340 
tcctttataa aaataagtac aaattccaga gaggttttcc tacacaaata caaaataatt 2400 
ggggaagggc tactcacctt ttttatgaaa cagaatatgt aacaagcaga ggacaaatag 2460 
actgacctgg ttaaacgtag gttctggatg ttgatgggta gcacctgcag taaacttgac 2520 
catgctctga aaaaagaaat ttctgggtat tgatggttta taaagccagc tttgtgctat 2580 
tggttcagta tattttatca aatctttgac ttcatcaacc cagtaatgac atagtttaaa 2640 



ttttaaaatg agagttctgt tttgctgtat ttctcctgct agcctagctt tgtattttat 2700 

catcagcttc agtatatact tgtatatata tacctggatt caaatggttc ttcatacaac 2760 

ttaatgatca gaaagaatgc tgctgtaaac accccactca agattgctgc tggaatttct 2820 

acccaaagct gtctgagagt ttatctttac ctcagtgcag gcttaaataa tggtgctaat 2880 

actggtagat atatactgtg aagcctggtt cacattcatg ctagaaaatg attttgggaa 2940 

atcttttata ggagaggtta catgattttt ctctttcaca tatctagaag gacttgcctc 3 000 

aaagaagaat gttgcagcaa acaggatata ctctagctgg tgcccaaagg ttcttgcaaa 3060 

accatgaatc ttgcttttag cagtaaaggg ccagacactt tggagcattg aaaggcctta 3120 

gcccctgctc cccaactacg tgccatgctg ggtagtgtgt atcctgtaca tctgtgtgcc 3180 

aggctggggc agactgtgcc aatgctcacc aaacactaga atctgctctt acacctctag 3240 

catgtatctc gttgtagcag agttgtgcac tttctcagca ttgtgtagtg ttttctaatt 3300 

gcatctaaaa acttatcaaa agtgattgtg aaaacagtgt cttagaagta taaacagaaa 3360 

tggcaatatt tatgtcctgt gattcaagcc caaaggttat aaattcaact ttcacaggga 3420 

atagcactgc taatcttact ttatgattta aatataaagg aaaatcacag cagccttaat 3480 

ttcctgttgg tcggatcatt tgcagcagtt ctagttcctg acttttaaaa tggatgaaat 354 0 

tttctctttg tctgatttgg aaaaggaact gcttttttgc cttactgctt tggtataagg 3600 

atgaaaaaca tgagcactct gcagacaaaa tgaccttaaa tcacattgat taagatattt 3660 

taaaagttag cagtgaacca aaagtagttt cagattagca gaaataaaga gctttaagtt 3720 

ttaaaagttg agattgaata tttaatgaaa gtttattaat tctttttcca ggaatagcag 3780 

taaggtcagt ttttttccct aaaataaaaa gttttaataa acagaaaatt atagcaacag 3840 

tacttaggag aatagttgaa gattgtatta attttaatta taatagttat ttgtcatagg 3900 

ctattgatga ttagaatttc attagttttg tccactataa ttttaaaaat agttgtgtca 3960 

aatacaaatt ctggtaaact gttaggtttt aacaccagac agtattcaaa ggaaattaat 4020 

gtttgcaaac ataaatccat agtcttcatt tctttatatt gtcacctttg taaaagtgtt 4080 

taaaatttgt attgtttgtt ttgtatatct ttgggcatct tgtgtctagc tataataaaa 4140 

agaaacggtg ccaag " 4155 



Homo sapiens NUCB2 protein (NUCB2) mRNA, complete cds. 

/trans lat ion= ™ MRWRTILLQYCFLL ITCLLTALEAVP ID IDKTKVQNIHPVE S AKI 

EPPDTGLYYDE YLKQVIDVLETDKHFREKLQKAD I EE IKS GRLS KEIiDLVSHHVRTKLD 

ELKRQEVGRIjRMLIKAKIiDSLQDI^ 

SDLEHYDKTRHEEFKKYEMMKEHERREYLKT^^ 

PGSKDQLK32VWEETDGLDPNDFDPKTFFKLHD^ 

NEEDDMVEMEEERLRMREHVMNEVDTNKDRLVTLEEFIjKATE 

FFTEEELKEYENIIAIiQENELKKKADELQKQKEELQRQHDQIiEAQKLEYHQVIQQMEQK 
KLQGIPPSGPAGELKFEPHI " 



caggtttgtg cgctggacgc aagcaccagg cgcagcctcg ctcgccgaga cccggccaga 60 

acgtgttacg agtcagtttt tagtgaaaaa acattgagct aggagccaag acccatctct 120 

tcactatttt ggtattgtgc aagtcatctt acctctctgg atctcagttg tctcatctgt 180 

aaaaaggaga taaaaattat ttacctgcct gaacatgagg tggaggacca tcctgctaca 240 

gtattgcttt ctcttgatta catgtttact tactgctctt gaagctgtgc ctattgacat 300 

agacaagaca aaagtacaaa atattcaccc tgtggaaagt gcgaagatag aaccaccaga 360 

tactggactt tattatgatg aatatctcaa gcaagtgatt gatgtgctgg aaacagataa 420 

acacttcaga gaaaagctcc agaaagcaga catagaggaa ataaagagtg ggaggctaag 480 

caaagaactg gatttagtaa gtcaccatgt gaggacaaaa cttgatgaac tgaaaaggca 540 

agaagtagga aggttaagaa tgttaattaa agctaagttg gattcccttc aagatatagg 600 

catggaccac caagctcttc taaaacaatt tgatcaccta aaccacctga atcctgacaa 660 

gtttgaatcc acagatttag atatgctaat caaagcggca acaagtgatc tggaacacta 720 

tgacaagact cgtcatgaag aatttaaaaa atatgaaatg atgaaggaac atgaaaggag 780 

agaatattta aaaacattga atgaagaaaa gagaaaagaa gaagagtcta aatttgaaga 840 

aatgaagaaa aagcatgaaa atcaccctaa agttaatcac ccaggaagca aagatcaact 900 

aaaagaggta tgggaagaga ctgatggatt ggatcctaat gactttgacc ccaagacatt 960 

tttcaaatta catgatgtca atagtgatgg attcctggat gaacaagaat tagaagccct 1020 

atttactaaa gagttggaga aagtatatga ccctaaaaat gaagaggatg atatggtaga 1080 

aatggaagaa gaaaggctta gaatgaggga acatgtaatg aatgaggttg atactaacaa 1140 

agacagattg gtgactctgg aggagttttt gaaagccaca gaaaaaaaag aattcttgga 1200 

gccagatagc tgggagacat tagatcagca acagttcttc acagaggaag aactaaaaga 1260 

atatgaaaat attattgctt tacaagaaaa tgaacttaag aagaaggcag atgagcttca 1320 

gaaacaaaaa gaagagctac aacgtcagca tgatcaactg gaggctcaga agctggaata 1380 

tcatcaggtc atacagcaga tggaacaaaa aaaattacaa ggaattcctc catcagggcc 1440 

agctggagaa ttgaagtttg agccacacat ttaaagtctg aagtccacca gaacttggaa 1500 
gaaa 



Homo sapiens glucose -6 -phosphate dehydrogenase, mRNA (cDNA clone MGC:8534 
IMAGE: 2822640) , complete cds . 



/translation="MAEQVAIiSRTQVCGILREELFQGDAFHQSDTHIFIIMGASGDIiAK 
KKIYPTIWWLFRDGIJjPENTFIVGYARSRLWADIRKQSEPFFKATPEEKIiKLEDFFAR 
NSYVAGQYDDAASYQRIoNSHMNALHL^ 

WNRIIVEKPFGRDLQSSDRLSNHISSLFREDQIYRIDHYL^ 

P IWNRDNIACVILTFKEPFGTEGRGGYFDEFGI IRDVMQNHLLQMLCLVAMEKPASTNS 
DDVMEKVKVLKCISEVQANNVVLGQYVGNPDGEGEA 

VLYVENERWDGVPF ILRCGKAIiNERKAEVRLQFHDVAGD I FHQQCKRNELVIRVQPNEA 
VYTKMMTKKPGMFFNPEESELDLTYGNRYKNVKDPDAYE 

REAWRI FTPLLHQIELEKPKPIPYI YGSRGPTEADELMKRVGFQYEGTYKWVNPHKIi n 



cacttcgggg ctgcgagcgc ggagggcgac gacgacgaag cgcagacagc gtcatggcag 60 

agcaggtggc cctgagccgg acccaggtgt gcgggatcct gcgggaagag cttttccagg 12 0 

gcgatgcctt ccatcagtcg gatacacaca tattcatcat catgggtgca tcgggtgacc 180 

tggccaagaa gaagatctac cccaccatct ggtggctgtt ccgggatggc cttctgcccg 24 0 

aaaacacctt catcgtgggc tatgcccgtt cccgcctcac agtggctgac atccgcaaac 300 

agagtgagcc cttcttcaag gccaccccag aggagaagct caagctggag gacttctttg 360 

cccgcaactc ctatgtggct ggccagtacg atgatgcagc ctcctaccag cgcctcaaca 420 

gccacatgaa tgccctccac ctggggtcac aggccaaccg cctcttctac ctggccttgc 4 80 

ccccgaccgt ctacgaggcc gtcaccaaga acattcacga gtcctgcatg agccagatag 540 

gctggaaccg catcatcgtg gagaagccct tcgggaggga cctgcagagc tctgaccggc 600 

tgtccaacca catctcctcc ctgttccgtg aggaccagat ctaccgcatc gaccactacc 660 

tgggcaagga gatggtgcag aacctcatgg tgctgagatt tgccaacagg atcttcggcc 720 

ccatctggaa ccgggacaac atcgcctgcg ttatcctcac cttcaaggag ccctttggca 780 

ctgagggtcg cgggggctat ttcgatgaat ttgggatcat ccgggacgtg atgcagaacc 840 

acctactgca gatgctgtgt ctggtggcca tggagaagcc cgcctccacc aactcagatg 900 

acgtccgtga tgagaaggtc aaggtgttga aatgcatctc agaggtgcag gccaacaatg 960 

tggtcctggg ccagtacgtg gggaaccccg atggagaggg cgaggccacc aaagggtacc 102 0 

tggacgaccc cacggtgccc cgcgggtcca ccaccgccac ttttgcagcc gtcgtcctct 1080 

atgtggagaa tgagaggtgg gatggggtgc ccttcatcct gcgctgcggc aaggccctga 1140 

acgagcgcaa ggccgaggtg aggctgcagt tccatgatgt ggccggcgac atcttccacc 1200 

agcagtgcaa gcgcaacgag ctggtgatcc gcgtgcagcc caacgaggcc gtgtacacca 1260 

agatgatgac caagaagccg ggcatgttct tcaaccccga ggagtcggag ctggacctga 1320 

cctacggcaa cagatacaag aacgtgaagc tccctgacgc ctatgagcgc ctcatcctgg 13 80 

acgtcttctg cgggagccag atgcacttcg tgcgcagcga cgagctccgt gaggcctggc 1440 

gtattttcac cccactgctg caccagattg agctggagaa gcccaagccc atcccctata 1500 

tttatggcag ccgaggcccc acggaggcag acgagctgat gaagagagtg ggtttccagt 1560 

atgagggcac ctacaagtgg gtgaaccccc acaagctctg agccctgggc acccacctcc 1620 

acccccgcca cggccaccct ccttcccgcc gcccgacccc gagtcgggag gactccggga 1680 

ccattgacct cagctgcaca ttcctggccc cgggctctgg ccaccctggc ccgcccctcg 1740 

ctgctgctac tacccgagcc cagctacatt cctcagctgc caagcactcg agaccatcct 1800 

ggcccctcca gaccctgcct gagcccagga gctgagtcac ctcctccact cactccagcc 1860 

caacagaagg aaggaggagg gcgcccattc gtctgtccca gagcttattg gccactgggt 1920 

ctcactcctg agtggggcca gggtgggagg gagggacgag ggggaggaaa ggggcgagca 1980 

cccacgtgag agaatctgcc tgtggccttg cccgccagcc tcagtgccac ttgacattcc 2040 

ttgtcaccag caacatctcg agccccctgg atgtcccctg tcccaccaac tctgcactcc 2100 

atggccaccc cgtgccaccc gtaggcagcc tctctgctat aagaaaagca gacgcagcag 2160 

ctgggacccc tcccaacctc aatgccctgc cattaaatcc gcaaacagcc aaaaaaaaaa 222 0 

aaaaaaaaaa ^ 00 , n 



Homo sapiens zinc finger protein 165 (Zpfl65) mRNA, complete cds. 



/ trans 1 at ion= " MATEPKKAAAQNSPEDEGIjIjIVKIEEEEF IHGQDTCLQRSEUjKQ 
ELCRQLFRQFCTQDSPGPREAIiSRIiRELCCQWIiKPEIHTKEQIIiELLVLEQFIiTILPGD 
IjQAWVHEHYPE S GEEAVT IIiEDLERGTDEAVLQVQAHEHGQE I FQKKVS PPGPALNVKL 
QPVETK^FDSSEPQLIjWDa>NESENSRSMPKL.EIFEKIESQRIISGRISGYISEASGE 
SQDICKSAGRVKRQWEKESGESQRIjSSAQDEGFGKILTHKNTVRGEIISHDGCERRIjNL 
NSNEFTHQKS CKHGTCDQ S FKWNSDF INHQ 1 1 YAGEKNHQ YGKS FKS PKLAKHAAVF S G 
DKTHQC^CGKAFRHSSKLARHQRIHTGER 

FGCKECGRAFNLNSHLIRHQRIHTREKPYECSECKSKTFRVSSHIilRHFRIHTGEKPYE 
SECGRAFSQS SNLSQHQRIHMRENLLM " 

Sequence 2150 BP; 690 A; 459 C; 523 G; 469 T; 9 Other; 

ggtcccggat ccgcgcgggt ttggggatcc anatgtccag ccccgtgtcc ccctccaaac 60 

atccagtccc tctcatattg cctttgaaat tagcagcctc tgggtgacca gaccttggcc 12 0 

ctcagaggaa tcccgganaa aggtanaacc agcttcgcgt tgggaacgca ggcgcgctta 180 

cgcatttagt gagggtttgg cggtctccat anttaccgcc gccgcgcgtg acntcatant 240 

ggagcgctga gggcttggtg gcgtggggtg ggggctgtcc tactgatcct gaatttgggt 300 

cactggtaan angagttgcc cattccancc aggtggaacg gggaggggta gccacatgtc 360 

tcagatctgc cattgtctgc gaaaagaaac tgctgcgagg accatcccca atcccctgct 420 

tcccttggga agagtaaccg ccgttttgta ggacacttgg ggacaacccc gcttgtcctg 480 

aaatttattg acacggtaaa tagtatttcc tgtgtgccga ggatgcagtt aaaccaacac 540 

tgaccccctg cccttgagaa acacaagatg gctacagaac caaagaaagc tgcagcccag 600 

aactctccag aggatgaagg acttctgata gtgaagatag aagaggaaga atttatccat 660 

gggcaggaca cttgcttaca gagaagtgaa ctccttaagc aggagctctg caggcagctt 720 

tttaggcagt tctgctacca ggattctcct ggacctcgcg aggcactgag ccgcctccgg 780 

gagctctgct gtcagtggct gaagccagag atccatacca aggaacagat tctggaactg 840 

ctggtgctag agcagttcct gaccatcctg ccaggagatt tgcaggcctg ggtacatgaa 900 

cattacccag agagtggaga ggaggcagtg accatactag aagatttgga gagaggcact 960 

gatgaagcag tactccaggt tcaagcccat gaacatggac aagaaatatt ccagaaaaaa 1020 

gtgtcacctc ctggaccagc acttaatgtc aagttacagc cagtggagac caaggcccat 1080 

tttgattcat cagaacccca gctcctatgg gactgtgata atgagagtga aaacagtaga 1140 

tccatgccaa agctggaaat ttttgaaaaa attgaatcac agagaattat atctggaaga 1200 

atctcaggat acatatcaga agcatctggt gagtctcaag acatctgtaa gtctgcaggc 1260 

agggtaaaga gacaatggga aaaagaatca ggggagtctc agagactctc gtctgcccag 1320 

gatgaaggtt ttggtaaaat cctcacccac aaaaatacag tcagaggtga aataataagc 13 80 

cacgatggat gtgagaggag attaaatctg aactcaaatg aattcacaca ccagaaatct 1440 

tgtaaacatg gtacctgtga ccagagcttc aaatggaact cagattttat taaccatcaa 1500 

ataatttatg ctggagaaaa aaatcaccaa tatggaaaat ctttcaagag cccaaaactt 1560 

gctaaacatg cagcagtttt cagtggagat aaaactcatc agtgtaatga atgtgggaaa 1620 

gctttcaggc acagctcaaa acttgctagg catcagagaa tccacactgg agagagatgc 1680 

tatgaatgta atgaatgtgg gaaaagcttt gcagagagct cagatcttac tagacatcgg 1740 

cgaattcaca ctggggaaag accctttggt tgcaaagaat gtgggagagc attcaacctg 1800 

aactcacatc ttatcaggca tcagagaatt cacaccagag agaaacccta cgagtgtagt 1860 

gaatgtggga aaaccttccg agtgagctca catcttattc gacactttag aattcacact 1920 

ggagaaaaac cctatgaatg cagtgagtgt ggaagagcct tcagtcagag ctcaaacctt 1980 

agtcaacacc agagaattca catgagggaa aacctattaa tgtaaggaac ttaaatttgt 2 040 

aagtaaatgc tgaggaaatg gcacaatatg aaaaatatta aataaaaaat aaatattggg 2100 

caagtggaag actgaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2150 



E 602326096F1 NIH_MGC_90 Homo sapiens cDNA clone IMAGE : 4414319 5', mRNA 

E sequence . 

X 

W EST. 
K 

5 Homo sapiens (human) 

2 Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; Mammalia; 

2 Eutheria; Primates; Catarrhini; Hominidae; Homo. 

X 

$ [1] 

P 1-887 

6 NTH -MGC ; 

r "National Institutes of Health, Mammalian Gene Collection (MGC) 

r ht tp : / /www . ncbi . nlm . nih . gov/MGC/ " ; 

L Unpubl i shed . 
K 

ft RZPD; IMAGp998O2410141. 

R UNILIB; 8584; 8584. 

K 

2 Contact: Robert Strausberg, Ph.D. 

2 Tel: (301) 496-1550 

2 Email : Robert_Strausberg@nih . gov 

2 Tissue Procurement: ATCC 

2 cDNA Library Preparation: Life Technologies, Inc. 

2 cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

2 DNA Sequencing by: Incyte Genomics, Inc. 

2 Clone distribution: MGC clone distribution information can be 

2 found through the I.M.A.G.E. Consortium/ LLNL at: 

2 http : //image . llnl . gov 

2 Plate: LLAM10141 row: o column: 24 

2 High quality sequence stop: 700. 



H Key Location/Qualifiers 
a 

r source 1..887 

T /db_xref="taxon:9606" 

T /db_xref="RZPD:IMAGp998O2410141" 

r / db_xre f = M RZPD : IMAGp 9 9 802 410141" 

T / db_xre f = " UNILIB : 8 5 8 4 " 

r /mol_type= "mRNA" 

r /note=" Organ: liver; Vector: pCMV-SPORT6; Site_l: NotI; 

r Site_2: Sail; Cloned unidirectionally; oligo-dT primed. 

r Average insert size 1.7 kb. Library enriched for 

r full-length clones and constructed by Life Technologies. 

r Note: this is a NIH_MGC Library." 

r /organism="Homo sapiens" 

T /clone= " IMAGE : 44 14319 " 

T /clone__lib= "NIH_MGC_90 " 

r /tissue_type= "adenocarcinoma, cell line" 

r /lab_host="DH10B (phage -resistant) " 

K 



3 Sequence 887 BP; 350 A; 100 C; 175 G; 262 T; 0 other; 

tatctgttca atgaaaataa ggtatgaccc aagtttttac ctagtctgac tagaagtatt 60 

ccacttcaag gtctgaagta ggacttttac cttaaaaaac aacaacaaac aaaactatca 120 

cacaggatag ataagaagat tggttaaaca gttttgtgta gatctttttg gtgctgaact 180 

atgacatgag ccttatagat tgtaaaatag ggatagttgg aactaatgta cagaactaaa 240 

ttttttaaac tttatttgct gttaaattct gtgaagtttc agttatctaa aataaatata 300 



cacaaatatq aaatataatg tttcagattg caaggtaata tgtaatagta gtgtttgtaa 
gatactctS tctaatatta actagtagta ttttgatttg tacagtcata atttgttaaa 
ataacttca? ttaacattca ctgatgtaga ttaataatgt aagttctgat ttaaagaatg 
gtggcaaaat ggtgcatgta atacttttgc aagtgttggg gagatcggta tgttttgaaa 
agStaattt larttttggg tgccaggaaa tgggttttct caaagtccat t9«33<=a a t 
gqgcaggcct gcaaatactg gcacagagca ttatcataca ccttattaac ggtgagggtg 
latacctttg aaataaagtt ttagagaaat gtttcagaaa aaaaaatata atacatgtag 
atacgagaca aaaaaaaaaa aaaatgaaaa aaaataaaaa aaaaagagag ggggacagat 
atatattcag gggagagaaa aaagacagat tatagaaagg cccaaaataa aaaaagaaga 
aggggtataa atcggaaaaa tgtgtgtaag acaactgtgg agaaaac 



360 

420 

480 

540 

600 

660 

720 

780 

840 

887 



DE Human prostaglandin endoperoxide synthase mRNA, complete cds. 
iCW prostaglandin endoperoxide synthase. 

?T /translations "MSRSLLIiRFLLFLIiLIiPPLPVIiLADPGAPTPVNPCCYYPCQHQGI 

CVRFGIJDRYQCIDCTOTGYSGPNCriPGLVmajmSIJIPSPSFTHFIiTHGRWFWEFVNA 

TTIREMLMWJVLTVRSOT.IPSPPTYNSAHDYISWESFSNVSYYTRILPSVPKDCPTPMG 

TKGKKQLPDAQLLARRFIiIjRRKFIPDPQGTNIjMFAFFAQHFTHQFFKTSGKMGPGFTKA 

LGHGVDLGHIYGDI^ERQYQIJ^FKDGKLKYQVIiDGEMYPPSVEEAPVIiMHYPRGIPPQ 

SQMAVGQF^GLLPGI,I^YATLWLREHNRVCDI,I,KAEHPTWGDEQLFQTTRL1L1GETI 

KI VI EE YVQQLS GYFLQLKFD PELLFGVQFQYRNRIAMEFNHIiYHWHPLMPDS FKVGS 0 

EYSYEQFLFNTSMLVDYGVEALVDAFSRQIAGRIGGGRNMDHHILHVAVDVIPJESREMR 
LQPFNE YRKRFGMKP YTS PTlRT ,vni!Ki!H> kdt .tot v/wttmi t 



?T 
?T 
?T 
?T 

?T 
?T 

' — ™»« vu«jr aKyiA^KJ.UUC3KNMDHHII,HVAVDVIRESREMR 

I'QPENEYRKRFGMKPYTSFQELVGEKEMAAELEEIiYGDIDAIjEFYPGIiLLEKCHPNSIF 

* Q ESMIEIGAPFSLKGI^NPICSPEYWKPSTFGGEVGFNIVKTATLKKLVCLNTKTCPY 
T VS FRVPDASQDDGPAVERPSTEIi " 



60 
120 
180 
240 



480 
540 
600 
660 
720 
780 
840 
900 



gcgccatgag ccggagtctc ttgctccggt tcttgctgtt cctgctcctg ctcccgccgc 
tccccgtcct gctcgcggac ccaggggcgc ccacgccagt gaatccctgt tgttactatc 
catgccagca ccagggcatc tgtgtccgct tcggccttga ccgctaccag tgtgactgca 
cccgcacggg ctattccggc cccaactgca ccatccctgg cctgtggacc tggctccgga 
attcactgcg gcccagcccc tctttcaccc acttcctgct cactcacggg cgctggttct 30 0 
gggagtttgt caatgccacc ttcatccgag agatgctcat gcgcctggta ctcacagtgc 360 
gctccaacct tatccccagt ccccccacct acaactcagc acatgactac atcagcfggg 420 
agtctttctc caacgtgagc tattacactc gtattctgcc ctctgtgcct aaagattgcc 
ccacacccat gggaaccaaa gggaagaagc agttgccaga tgcccagctc ctggcccgcc 
gcttcctgct caggaggaag ttcatacctg acccccaagg caccaacctc atgtttgcct 
tctttgcaca acacttcacc caccagttct tcaaaacttc tggcaagatg ggtcctggct 
tcaccaaggc cttgggccat ggggtagacc tcggccacat ttatggagac aatctggagc 
gtcagtatca actgcggctc tttaaggatg ggaaactcaa gtaccaggtg ctggatggag 
aaatgtaccc gccctcggta gaagaggcgc ctgtgttgat gcactacccc cgaggcatcc 
cgccccagag ccagatggct gtgggccagg aggtgtttgg gctgcttcct gggctcatgc 
2J£2^r C 3CtCt9 f cta cgtgagcaca accgtgtgtg tgacctgctg aaggctgagc 960 
accccacctg gggcgatgag cagcttttcc agacgacccg cctcatcctc ataggggaga 1020 
ccatcaagat tgtcatcgag gagtacgtgc agcagctgag tggotatttc otgclgctga lolo 
aatttgaccc agagctgctg ttcggtgtcc agttccaata ccgcaaccgc attgccatgg ll 40 
agttcaacca tctctaccac tggcaccccc tcatgcctga ctccttcaag gtgggctccc 1200 
aggagtacag ctacgagcag ttcttgttca acacctccat gttggtggac tatggggttg 1260 
aggccctggt ggatgccttc tctcgccaga ttgctggccg gatcggtlggg ggcagglao! ll 20 
S« a - catcctgcat gtggctgtgg atgtcatcag ggagtctcgg glga?gcggc llso 
tgcagccctt caatgagtac cgcaagaggt ttggcatgaa accctacacc tccttccagg 
agctcgtagg agagaaggag atggcagcag agttggagga attgtatgga gacattgatg 
cgttggagtt ctaccctgga ctgcttcttg aaaagtgcca tccaaactct atctttgggg 
agagtatgat agagattggg gctccctttt ccctcaaggg tctcctaggg aatcccatct 
gttctccgga gtactggaag ccgagcacat ttggcggcga ggtgggcttt aacattgtca 
agacggccac actgaagaag ctggtctgcc tcaacaccaa gacctgtccc tacgtttcct 
tccgtgtgcc ggatgccagt caggatgatg ggcctgctgt ggagcgacca tccacagagc 
tctgaggggc aggaaagcag cattctggag gggagagctt tgtgcttgtc attccagagt 
gctgaggcca gggctgatgg tcttaaatgc tcattttctg gtttggcatg gtgagtgttg 
gggttgacat ttagaacttt aagtctcacc cattatctgg aatattgtga ttctgtttat 
tcttccagaa tgctgaactc cttgttagcc cttcagattg ttaggagtgg ttctcatttg 
gtctgccaga atactgggtt cttagttgac aacctagaat gtcagatttc tggttgattt 
gtaacacagt cattctagga tgtggagcta ctgatgaaat ctgctagaaa gttagggggt 
tcttattttg cattccagaa tcttgacttt ctgattggtg attcaaagtg ttgtgttccc 
tggctgatga tccagaacag tggctcgtat cccaaatctg tcagcatctg gctgtctaga 
atgtggattt gattcatttt cctgttcagt gagatatcat agagacggag atcctaaggt 
ccaacaagaa tgcattccct gaatctgtgc ctgcactgag agggcaagga agtggggtgt 
tcttcttggg acccccacta agaccctggt ctgaggatgt agagagaaca ggtgggctgt 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



attcacgcca ttggttggaa gctaccagag ctctatcccc atccaggtct tgactcatgg 2520 
cagctgtttc tcatgaagct aataaaattc gccc * ! - J °* 1 



3E Human mRNA for tyrosine hydroxylase type 3 



TT 
?T 
•T 
•T 

? T HALSAIG" 



/translation=»MPTPDATTPQAKGFRRAVSEIJDAKQAEAIMGAPQPSLTGSPWPGT 
AAPAASYTPTPRSPRFIGRRQSLIEDARKEREAAVAAAAAAVPSEPGDPLEAVAFEEKE 
GKAVLNLLFS PRATKPSALSRAVKVFETFEAKIHHLETRPAQRPRAGGPHLEYFVRLEV 
RRGDLAALLSGVRQVSEDVRSPAGPKVPWPPRKVSEIjDKCHHLVTKFDPDLDLDHPGFS 
DQVyRQRRKXi I AE IAFQ YRHGDP I PRVEYTAEE I ATWKEVYTTLKGL YATHACGEHLEA 

FALLERFSGYREDNIPQLEDVSRFLKERTGFQIiRPVAGLLSARDPLASLAFRVFQCTQY 
IRHASSPMHSPEPDCCHELLGHVPMLADRTFAQFSQDIGIiASLGASDEEIEKIiS'njSWF 
TVEFGLCKQNGEVKAYGAGLLSSYGEliLHCLSEEPEIRAFDPEAAAVQPYQDQTYQSVY 
WSESFSDAKDKI^SYASRIQRPFSVKFDPYTLAIDVLDSPQAVRRSLEGVQDELDTLA 



tccacactga gccatgccca cccccgacgc caccacgcca caggccaagg gcttccgcag 60 

ggccgtgtct gagctggacg ccaagcaggc agaggccatc atgggcgccc cggggcccag 12 0 

cctcacaggc tctccgtggc ctggaactgc agccccagct gcatcctaca cccccacccc 180 

aaggtccccg cggttcattg ggcgcaggca gagcctcatc gaggacgccc gcaaggagcg 24 0 

ggaggcggcg gtggcagcag cggccgctgc agtcccctcg gagcccgggg accccctgga 300 

ggctgtggcc tttgaggaga aggaggggaa ggccgtgcta aacctgctct tctccccgag 360 

ggccaccaag ccctcggcgc tgtcccgagc tgtgaaggtg tttgagacgt ttgaagccaa 420 

aatccaccat ctagagaccc ggcccgccca gaggccgcga gctgggggcc cccacctgga 480 

gtacttcgtg cgcctcgagg tgcgccgagg ggacctggcc gccctgctca gtggtgtgcg 540 

ccaggtgtca gaggacgtgc gcagccccgc ggggcccaag gtcccctggt tcccaagaaa 600 

agtgtcagag ctggacaagt gtcatcacct ggtcaccaag ttcgaccctg acctggactt 660 

ggaccacccg ggcttctcgg accaggtgta ccgccagcgc aggaagctga ttgctgagat 720 

cgccttccag tacaggcacg gcgacccgat tccccgtgtg gagtacaccg ccgaggagat 780 

tgccacctgg aaggaggtct acaccacgct gaagggcctc tacgccacgc acgcctgcgg 840 

ggagcacctg gaggcctttg ctttgctgga gcgcttcagc ggctaccggg aagacaatat 900 

cccccagctg gaggacgtct cccgcttcct gaaggagcgc acgggcttcc agctgcggcc 960 

tgtggccggc ctgctgtccg cccgggactt cctggccagc ctggccttcc gcgtgttcca 1020 

gtgcacccag tatatccgcc acgcgtcctc gcccatgcac tcccctgagc cggactgctg 1080 

ccacgagctg ctggggcacg tgcccatgct ggccgaccgc accttcgcgc agttctcgca 1140 

ggacattggc ctggcgtccc tgggggcctc ggatgaggaa attgagaagc tgtccacgct 1200 

gtcatggttc acggtggagt tcgggctgtg taagcagaac ggggaggtga aggcctatgg 1260 

tgccgggctg ctgtcctcct acggggagct cctgcactgc ctgtctgagg agcctgagat 1320 

tcgggccttc gaccctgagg ctgcggccgt gcagccctac caagaccaga cgtaccagtc 1380 

agtctacttc gtgtctgaga gcttcagtga cgccaaggac aagctcagga gctatgcctc 1440 

acgcatccag cgccccttct ccgtgaagtt cgacccgtac acgctggcca tcgacgtgct 1500 

ggacagcccc caggccgtgc ggcgctccct ggagggtgtc caggatgagc tggacaccct 1560 

tgcccatgcg ctgagtgcca ttggctaggt gcacggcgtc cctgagggcc cttcccaacc 1620 

tcccctggtc ctgcactgtc ccggagctca ggccctggtg aggggctggg tcccgggtgc 1680 

cccccatgcc ctccctgctg ccaggctccc actgcccctg cacctgcttc tcagcgcaac 1740 

agctgtgtgt gcccgtggtg aggttgtgct gcctgtggtg aggtcctgtc ctggctccca 1800 

gggtcctggg ggctgctgca ctgccctccg cccttccctg acactgtctg ctgccccaat i860 
caccgtcaca ataaaagaaa ctgtggtctc t 



1891 



Homo sapiens mRNA; CDNA DKFZp566A093 (from clone DKFZp566A093) ; complete 



cds 



/ trans 1 at ion= " MYQTPMEVAVYQLHNFS I SFFSSLLGGDWSVKIjDNS ASGASVVA 
IDl^EQAMDLVKirai^AVREEVEIIiKEQIR^ 
FQSCLSPEEPAPESPQVPEAPGGSAV " 

agtctgggtt ggactggcgg ccgtggagtt tgtgacatac gaggtgacac ccctcgagtc 
acttcccttc aactccagct ggagcgcctg cttggctttg ggttcgttct gcagccttcg 
ccccgctcct agcctcaggg ccggactccg gcgcagagcc cagcccagcg cagcctgcca 
gcagccaccc agccgcccag ccgcccagcc ccgcacgaaa cccggccaga gcttcctagc 
agcccgagcc atgaacaccg aatgtatcag acccccatgg aggtggcggt ctaccagctg 
cacaatttct ccatctcctt cttctcttct ctgcttggag gggatgtggt ttccgttaag 
ctggacaaca gtgcctccgg agccagcgtg gtggccatag acaacaagat cgaacaggcc 
atggatctgg tgaagaatca tctgatgtat gctgtgagag aggaggtgga gatcctgaag 
gagcagatcc gagagctggt ggagaagaac tcccagctag agcgtgagaa caccctgttg 
aagaccctgg caagcccaga gcagctggag aagttccagt cctgtctgag ccctgaagag 
ccagctcccg aatccccaca agtgcccgag gcccctggtg gttctgcggt gtaagtggct 
ctgtcctcag ggtgggcaga gccactaaac ttgttttacc tagttctttc cagtttgttt 
ttggctcccc aagcatcatc tcacgaggag aactttacac ctagcacagc tggtgccaag 
agatgtccta aggacatggc acctgggtcc actccagcga cagacccctg acaagagcag 
gtctctggag gctgagttgc atggggccta gtaacaccaa gccagtgagc ctctaatgct 
actgcgccct gggggctccc agggcctggg caacttagct gcaactggca aaggagaagg 
gtagtttgag gtgtgacacc agtttgctcc agaaagttta aggggtctgt ttctcatctc 
catggacatc ttcaacagct tcacctgaca acgactgttc ctatgaagaa gccacttgtg 
ttttaagcag aggcaacctc tctcttctcc tctgtttcgt gaaggcaggg gacacagatg 
ggagagattg agccaagtca gccttctgtt ggttaatatg gtataatgca tggctttgtg 
cacagcccag tgtgggatta cagctttggg atgaccgctt acaaagttct gtttggttag 
tattggcata gtttttctat atagccataa atgcgtatat atacccatag ggctagatct 
gtatcttagt gtagcgatgt atacatatac acatccacct acatgttgaa gggcctaacc 
agccttggga gtattgactg gtcccttacc tcttatggct aagtctttga ctgtgttcat 
ttaccaagtt gacccagttt gtcttttagg ttaagtaaga ctcgagagta aaggcaagga 
ggggggccag cctctgaatg cggccacgga tgccttgctg ctgcaaccct ttccccagct 
gtccactgaa acgtgaagtc ctgttttgaa tgccaaaccc accattcact ggtgctgact 1620 
acatagaatg gggttgagag aagatcagtt tgggcttcac agtgtcattt gaaaacgttt 1680 
tttgttttgt tttgtaatta ttgtggaaaa ctttcaagtg aacagaagga tggtgtccta 1740 
ctgtggatga gggatgaaca aggggatggc tttgatccaa tggagcctgg gaggtgtgcc 1800 
cagaaagctt gtctgtagcg ggttttgtga gagtgaacac tttccacttt ttgacacctt i860 
atcctgatgt atggttccag gatttggatt ttgattttcc aaatgtagct tgaaatttca 1920 
ataaactttg ctctgttttt ctaaaaataa aaaaaaaaaa aaaaaaaa 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



3E Homo sapiens mRNA for Id-IH, complete cds. 



' * /translation= "MKVASGSTATAAAGPTCALKAGKTASGAGEWRCLSEQSVAISRC 

?T RGAGARLP ALLDEQQVNVLLYDMNGCYSRLKELVPTLPQNRKVS KVE I LQHVTDY IRDL 

?T QLELNSESEVGTPGGRGLPVRAPIiSTLNGEISALTAEAACVPADDRILCR » 

ttcagccagt cgccaagaat catgaaagtc gccagtggca gcaccgccac cgccgccgcg 60 

ggccccacgt gcgcgctgaa ggccggcaag acagcgagcg gtgcgggcga ggtggtgcgc 120 

tgtctgtctg agcagagcgt ggccatctcg cgctgccggg gcgccggggc gcgcctgcct 180 

gccctgctgg acgagcagca ggtaaacgtg ctgctctacg acatgaacgg ctgttactca 240 

cgcctcaagg agctggtgcc caccctgccc cagaaccgca aggtgagcaa ggtggagatt 300 

ctccagcacg tcatcgacta catcagggac cttcagttgg agctgaactc ggaatccgaa 360 

gttggaaccc ccgggggccg agggctgccg gtccgggctc cgctcagcac cctcaacggc 420 

gagatcagcg ccctgacggc cgaggcggca tgcgtccctg cggacgatcg catcttgtgt 480 
cgctgaaggc cttccccagg gaccggcgg ■ - = 



Bomo sapiens mRNA for KIAA1254 protein, partial cds. 

/translation=»KMSENSSDSDSSCGWTVISHEGSDIEMLNSVTPTDSCEPAPECSS 
LEQEELQALQIEQGESSQNGTVLMEETAYPALEETSSTIEAEEQKIPEDSIYIGTASDD 
SD IVTLEPPKLEEIGNQEWIVEEAQS SEDFNMGS SSS SQYTFCQPETVFSSQPSDDES 
S SDETSNQPS P AFRRRRARKKTVS ASESEDRliVGEQETEPS KEIiSKRQFS S GUJKCVILi 
ALVIAISMGFGHFYGTIQIQKRQQLVMCIHEDEIJroMKDYLSQCQQEQESFIDYKSLKE 
NI1ARCWTLTEAEKMSFETQKTNI1ATENQYI1RVSLEKEEKAL 

DKGTSTELVKENQKIjKQHLEEEKQKKHSFLSQRETLLTEAKMLKRELERERLVTTAIiRG 
ELQQIjSGSQIjHGKSDSPNVYTEKKEIAILRERIjTELERKLTFEQQRSDLWERIiYVEAKD 
QNGKQGTDGKKKGGRGSHRAKNKSKETFIiGSVKETFDAMKNSTKEFVRHHKEKIKQAKE 
AVKEinjKKFSDSVKSTFRHFKDTTKNIFDEKGNKRFGATKEAAEKPRTVFSDYLHPQYK 
APTENHHNRGPTMQ1TOGRKEKPVHFKEFRKNTNSKKCSPGHDCRENSHSFRKACSGVFD 

CAQQESMSLFNTWNP IRMDEFRQ I IQRYMLKEIiDTFCHWNEIiDQF INKFFLNGVF IHD 
QKLFTDFVinDVKDYIiRNMKEyEVDNDGVFEKIjDEyiYRHFFGBTFSPPYGPRSVyiKPC 



HYSSL 



cattggcgcc cgagctgtga ccgccgccac tggggcagcc agcacaatcg ggcggaggtg 
gcgctgcccc ttcagacctg aaagatgtct gaaaattcca gtgacagtga ttcatcttgt 
ggttggactg tcatcagtca tgaggggtca gatatagaaa tgttgaattc tgtgaccccc 
actgacagct gtgagcccgc cccagaatgt tcatctttag agcaagagga gcttcaagca 
ttgcagatag agcaaggaga aagcagccaa aatggcacag tgcttatgga agaaactgct 
tatccagctt tggaggaaac cagctcaaca attgaggcag aggaacaaaa gatacccgaa 
gacagtatct atattggaac tgccagtgat gattctgata ttgttaccct tgagccacct 
aagttagaag aaattggaaa tcaagaagtt gtcattgttg aagaagcaca gagttcagaa 
gactttaaca tgggctcttc ctctagcagc cagtatactt tctgtcagcc agaaactgta 
ttttcatctc agcctagtga tgatgaatca agtagtgatg aaaccagtaa tcagcccagt 
cctgccttta gacgacgccg tgctaggaag aagaccgttt ctgcttcaga atctgaagac 
ccjqctagttg gtgaacaaga aactgaacct tctaaggagt tgagtaaacg tcagttcagt 
agtggtctca ataagtgtgt tatacttgct ttggtgattg caatcagcat gggatttggc 
catttctatg gcacaattca gattcagaag cgtcaacagt tagtcagaaa gatacatgaa 
gatgaattga atgatatgaa ggattatctt tcccagtgtc aacaggaaca agaatctttt 
atagattata agtcattgaa agaaaatctt gcaaggtgtt ggacacttac tgaagcagag 
aagatgtcct ttgaaactca gaaaacgaac cttgctacag aaaatcagta tttaagagta 
tccctggaga aggaagaaaa agccttatcc tcattacagg aagagttaaa caaactaaga 
gaacagatta gaatattgga agataaaggg acaagtactg aattagttaa agaaaatcag 
aaacttaagc agcatttgga agaggaaaag cagaaaaaac acagctttct tagtcaaagg 
gagactctgt tgacagaagc aaagatgcta aagagagaac tggagagaga acgactagta 
actacggctt taagggggga actccagcag ttaagtggta gtcagttaca tggcaagtca 
gattctccca atgtatatac tgaaaaaaag gaaatagcaa tcttacggga aagactcact 
gagctggaac ggaagctaac cttcgaacag cagcgttctg atttgtggga aagattgtat 
qttgaggcaa aagatcaaaa tggaaaacaa ggaacagatg gaaaaaagaa agggggcaga 
ggaagccaca gggctaaaaa taagtcaaag gaaacatttt tgggttcagt taaggaaaca 
tttgatgcca tgaagaattc taccaaggag tttgtaaggc atcataaaga gaaaattaag 
caggctaaag aagctgtgaa ggaaaatctg aaaaaattct cagattcagt taaatccact 
ttcagacact ttaaagatac caccaagaat atctttgatg aaaagggtaa taaaagattt 
ggtgctacaa aagaagcagc tgaaaaacca agaacagttt ttagtgacta tttacatcca 
cagtataagg cacctacaga aaaccatcat aatagaggcc ctactatgca aaatgatgga 
aggaaagaaa agccagttca ctttaaagaa ttcagaaaaa atacaaattc aaagaaatgc 
agtcctgggc atgattgtag agaaaattct cattctttca gaaaggcttg ttctggtgta 
tttgattgtg ctcaacaaga gtccatgagc ctttttaaca cagtggtgaa tcctataagg 
atggatgaat ttagacagat aattcaaagg tacatgttaa aagaactgga tactttttgt 
cactggaacg aacttgatca gttcatcaat aagtttttcc taaacggtgt ctttatacat 
gatcagaagc tcttcactga ctttgttaat gatgttaaag attatcttag aaa « a ^ag 
iaatatgaag tagataatga tggagtattt gagaagttgg atgaatatat atatagacac 
ttctttggtc acactttttc ccctccatat ggacccaggt cggtttacat aaaaccgtgt 
cattacagta gtttgtaaca tttgtagatt ggatagcatt tttatgattt gatgagtttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



ttgtaaggtt accgtttcta agagttgtgc tttatggcca ctgagagaat tcagaataaa 2460 

ttgaaagatg gagtctaaaa attattagct gttacaaatg gaacatttca ttataacgtg 2520 

atcactttga cttgagcaaa tggtttaatt tttatcttaa aaatcagtta agaatatata 2580 

aaatcctact ttggccaagt ttgtttcttt tcattatagt ttatatgaaa agatcacctt 264 0 

aagtgaaatt attttccttt aatcttttat gtatttattc acttttggaa gctaggaatg 2700 

agcaacacaa attttactct gaagtcagaa gagctcatat ataataattc taatgtccca 2760 

cctattttca cttgtccatt ccatgtacca gcttagttat gatacttagt cacataatta 282 0 

tctttgataa aggtagaggc acaaagaggc aaactaagca agtcaaattc taatgtgtgt 2880 

acttcataat aattttttat ccattttcat ctttatattc tgtaacatga aacttaccta 2940 

atcttcaaat gttagcttca ttttttacct ttgaaatact taatctttct gaataaatat 3000 

aatgtgtcta taaaataatg agactgattc tggtgtcttt agttattaag ctggtatcta 3060 

gtcctataat gaacaaaggt gaagctgcct tgaggagaca agtgaaaaat ttttgcttca 312 0 

aaggagctca caagctaagt aaataaatga aattaaggta tggggcatgg tggcctcagg 3180 

ctgtctggag gtgtttggaa aggcttcttg agtgaggtgg cctttgaact gaacttagtt 3240 

tttaaagtag cttttggaag agaaatgagg atttgctatg cagacaggga agggaatttc 33 00 

acttaaaagg aaggtcattt ggagatgtga agatacactg ctttaaggaa gcagggtaga 3360 

gctggaggat aagagatgca gaccatgaag ggccccattt tatgctaaag gttttgtcct 342 0 

gtaggacatg gagaacttct gaagaatttt caaggcgggt gggataagat tatattgtat 3480 

tttagattac agtagtcccc ccttatcttc aggatatatg ttccaagacc cccagtggat 3540 

gctggaaacc agggatagaa cataattcta tatatactat gcatgaattt ctttttcctt 3600 

ctttacaatc tcacacatag gtttgttctt actatagatc ttaccaatct cagcatactt 3660 

ttatttctct tgagaacctt caccctttca cttaaaggag gcgctttata gcttctcttt 372 0 

ggcatatcca aatgccagca tcactgttgt attttggggt cattattaag ttacttaatc 3780 

atccttaatc cttatcttag ggatacttga acacaaacac tggtaggata acagtatatc 3840 

tgattaacag actgctacta ggtgattaat gggtgggtag tgtaaataca caagaaaagg 3900 

atgattcaca tcccatgtgg gatggagcag aactgcatta tttcattaca ttactcagaa 3960 

caggcatata attgaaaact tatgaattat tattttttaa ttatttgaga tggaatcttg 4020 

ctttgtcagc caggctggag tgcagtgaca cgatctcagc tcactgcaac ctctgtctcc 4080 

tgggttcagg tgattctcct ccctagtctc ccaagtagct gggactatag gcacgtgcca 414 o 

ccacacccgg ctaattttca tatttttagt agagatgggg tttcaccatg ttggccaggc 4200 

tgttcttgaa atcctgacct caagtgatcc acacatgtct acctcccaaa gtgctgggat 4260 

tacagtcgtg agccactgta ccccgcctaa aactgatgaa ttatttctga aattttctat 4320 

ttaacatttt cagaccacag ttgaccacag gtaacggaaa cctcaatcac agaaagtaaa 4380 

gccgtggata cggtgggact aatgtattgg tagcagccta gaggattgat gggaaaggta 444 0 

tgaagctaga aggtggtcaa tataatacag acatgagctg atgaacatct aaactgggac 4500 

tatactagta ggagaggaaa ggaaaaaaca tttggaaaat agtaacattg atatttcttg 4560 

tgaaggagaa gtagaaagta acagtgactt ctagatttct gggttgggtc atctgttgtt 462 0 

ggatagtagt accactgaga tagggaattc aaggtttggg gcaagggtaa ttggagatga 4680 

gaattgtgtt tggaggtaac tactgacatt caagtggaga gggttagttg gcagttagtt 4740 

ctatggtcat ctcttttgcc gagactgtat atttatcaga ctcctgggag aacaccaaca 4800 

tccatggggt tgtagggaag gctaaggaca agagtgggga gtggtacctt gaaaatccaa 4860 

aagccatctc aagtaaaagg aataaatgtg tcatgctttt taaaaagttg atgtgcggaa 492 0 

aatgttttct tggcttggaa actgggcggc caggggatga cagtatggac ttccagtgaa 4980 

gtagtgacgg aagcctgatc atagacatta aggaaagcgg tgtaggtgtt gtgagctttt 5040 

gctgtaagaa aaagttgaga cttttgtttt gctttgtttg tgagagatgt gtatgtattt 5100 

ctgctgagtg ataaagccag cggggaggga ctgattttta taggaaagga ggaaaaataa 5160 

tggaaacaca tctcattatt ttattgtcac atttcttttc tttgttatct tttgagtgtt 522 0 

tccctttttt gccagtagag ttattgtcta ttttttcttt ctataggaca aaaaaactaa 5280 

tacagactcc tttattttta tatggatata ctaggattgt aattcagata tttaatatct 5340 

tttatcagtg ttcagatcat agattaatgg agaaaacatt taaaattgtt ttaaatttaa 5400 

atacattgaa ctctaacata gatgaaaaat gtgtttactg ctttcagtcg acctgataaa 5460 

aagcaacgta tggtaaatat tgaaaactcc aggcatcgaa aacaagagca gaagcacctt 5520 

cagccacagc cttataaaag ggaaggtaaa tggcataaat atggtcgcac taatggaaga 5580 

caaatggcaa atcttgaaat agaattgggg caattacctt ttgatcctca atactgattc 5640 

acaattgagt taaattagac aactgtaaga gaaaaattta tgctttgtat aatgtttggt 5700 

attgaaacta atgaaattac caagatgaca atgtcttttc ttttgtttct aagtatcagt 5760 

ttgataactt tatattattc ctcagaagca ttagttaaaa gtctactaac ctgcattttc 5820 



ctgtagttta gcttcgttga attttttttg acactggaaa tgttcaactg tagttttatt 
aaggaagcca ggcatgcaac agattttgtg catgaaatga gacttccttt cagtgtaaga 
gcttaaagca agctcagtca tacatgacaa agtgtaatta acactgatgt ttgtgttaaa 
tttgcagcag agcttgagaa aagtacattg ttctggaatt tcatcattaa cattttataa 
tcttacactc acttcttgtc tttttgtggg ttcaagagcc ctctgacttg tgaagaattt 6120 
gctgccctct taagagcttg ctgacttgtt ttcttgtgaa attttttgca catctgaata « so 

tcgtggaaga aacaataaaa ctacaccatg agg 



5880 
5940 
6000 
6060 



6180 
6213 



ME Homo sapiens cDNA clone :HEMBA1001328, 3' end, expressed in whole embryo, 

»E maxnly head. 1 ' 

gtagccttta tttacttaaa catttatttg cttctaggaa ataagcgctt tcctaatttc 60 

aagcaattat aaaagaactg ctgttttctt ccacactcac ttgccagagg gtcgaattgg 120 

aagtcacata tatgtctatg aacggaagtt aaaagggaaa ttcaacatga agatgaaatt 180 

ctgaactttc ctagataaat taacattgct gggtggaaat attcagatgc tgcttaaata 240 

cttcggtaaa cactgggtaa gattcatgga acttagaaaa aagctgtatg aactgcttta 300 

ccaaatatca ctactgagga aatgtataaa ataccacata gtataaaatt acatgttaat 360 

ccaatgccag attttaaata aaggacctta agttttcctc aagggggaag tttaatgggt 420 

cnttcccgnt ntcanagggc caaaaanttc ccaaggaaac caggtagnaa gctcttnaaa 480 

ggccgcaaaa t ll\ 



Homo sapiens raRNA; cDNA DKFZp564F1862 (from clone DKF2p564F1862) ; complete 



cds 



/translation=»MATPQSIFIFMC^ 
FHKIiAMKYHPDKNKSPDAEAKFREIAEAYET^ 

SSFEQSFNFNFDDLFKDFGFFGQNQNTGSKKRFENHFQTRQDGGSSRQRHHFQEFSFGG 
GLFDDMFEDME KMF S F S GFD S TNQHTVQTENRFHG S S KHCRTVTQIU^GNMVTTYTD C SG 



gaggcttctg aggtggtggc gccagcggct acctcctgcc tgtgaggagc tggctgagag 
gggactgggc gccggcgggg aaggaggagc gctaggtcgg tgtacgaccg agattagggt 
gcgtgccagc tccgggaggc cgcggtgagg ggccgggccc aagctgccga cccgagccga 
tcgtcagggt cgccagcgcc tcagctctgt ggaggagcag cagtagtcgg agggtgcagg 
atattagaaa tggctactcc ccagtcaatt ttcatctttg caatctgcat tttaatgata 
acagaattaa ttctggcctc aaaaagctac tatgatatct taggtgtgcc aaaatcggca 
tcagagcgcc aaatcaagaa ggcctttcac aagttggcca tgaagtacca ccctgacaaa 
aataagagcc cagatgctga agcaaaattc agagagattg cagaagcata tgaaacactc 
tcagatgcta atagacgaaa agagtatgat acacttggac acagtgcttt tactagtggt 
aaaggacaaa gaggtagtgg aagttctttt gagcagtcat ttaacttcaa ttttgatgac 
ttatttaaag actttggctt ttttggtcaa aaccaaaaca ctggatccaa gaagcgtttt 
gaaaatcatt tccagacacg ccaggatggt ggttccagta gacaaaggca tcatttccaa 
gaattttctt ttggaggtgg attatttgat gacatgtttg aagatatgga gaaaatgttt 
tcttttagtg gttttgactc taccaatcag catacagtac agactgaaaa tagatttcat 
ggatctagca agcactgcag gactgtcact caacgaagag gaaatatggt tactacatac 
actgactgtt caggacagta gttcttattc tattctcact aaatccaact ggttgactct 
tcctcattat ctttgatgct aaacaatttt ctgtgaacta ttttgacaag tgcatgattt 
cactttaaac aatttgatat agctattaag tatatttaag ggtttttttt ttttgacaaa 
ttcaacattc aacgagtaga caaaatgcta attatttccc tgattaggaa agtttcttta 
aaaaacacgt aattttgcct agtgcttttt ctctacctgc ccttgggctc actaatatca 
ccagtattat taccaagaaa atattgagtt tacctgatta aactttaaaa gttaattgta 
gatttaaatt gtgtgaacct aatgattttt gcagtgaaac ctttactaat tcaaagttgc 1320 
atgttctatg acatctgtga cttgcgttgc agagtgtaca tgaaactgta taattgagtc 1380 
attcagtaaa ggagaacagt atcttggtta attgctactg aaaggttgag aaaggaatgg 1440 
tttgatattt accacagcgc tgtgcctttc tacagtagaa ctggggtaaa ggaaatggtt 1hfl0 
ttattgccca tagtcattta ggctggaaaa aagttgaaaa cttaacgaaa tattgccaag 
agattgttat gtgtttggtt ccagcctaaa aatgattttg tagtgttgaa atcatagcta 
cttacatagc tttttcatat ttctttctta gttgttggca ctcttaggtc ttagtatgga 
tttatgtgtt tgtgtgtgtg tagtttatcc tctctctcat ctttatctag agattgactg 
atacctcatt ctgtttgtaa aaccagccag taatttctgt gcaaccttac tatgtgcaat 1800 
atttttaaat cctgagaaat gtgtgctttt gttttcggat agacttattt ctttagttct I860 
gcacttttcc acattatact ccatatgagt attaatccta tggatacata ttaaaacaag 1920 
tgtctcatac aacattgtat gtgagagaaa tataaatatt tacaacctaa aaaaaaaaaa 1980 
aaaaaaa 



60 
120 
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240 
300 
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1500 
1560 
1620 
1680 
1740 



cds. H ° m ° SapienS amexin A1 ' roRNA (CDNA clone MGC:5095 IMAGE : 3459615 ) , complete 



/ translation " MAMVSEFLKQAWFIENEEQBYVQTVKS S KGGPGS AVS PYPTFNPS 

EEVVLALLKTPAQFDADELRAAMKGLGTDEDTLIE ILASRTNKE IRDINRVYREELKRD 
Ij ^ I> ™ ^^^^ ^ D FRNALLS LAKGDRS ED FGVNEDLAD SD ARAL YEAGERRKGTDVITVPN 
TILTTRSYPQLRRVFQKYTKYSKHDMNKVLDLELKGDIEKCIjTAIVKC^ 
LHQAMKGVGTRHKALIRIMTO^^ 



ALCGGN" 



atttctcttt 
ggtatcagaa 
aactgtgaag 
tccatcctcg 
aaccatcatt 
atatctccag 
ccttgaggag 
tcgtgctgcc 
aagaactaac 
tctggccaaa 
tgctaagggt 
cagggccttg 
catccttacc 
cagtaagcat 
cctcacagct 
tcaagccatg 
ttctgaaatt 
ttgccaagcc 
tggaggaaac 
tattttcatc 
ctacatgctg 
ataagtccat 
gtaacaatac 
aaaaaaaaaa 



agttctttgc 
ttcctcaagc 
tcatccaaag 
gatgtcgctg 
gacattctaa 
gaaacaggaa 
gttgttttag 
atgaagggcc 
aaagaaatca 
gacataacct 
gaccgatctg 
tatgaagcag 
accagaagct 
gacatgaaca 
atcgtgaagt 
aaaggtgttg 
gacatgaatg 
atcctggatg 
taaacattcc 
ctataagctt 
aaaaatatag 
tttttaaaaa 
atgagaaaga 
aaaaaaaaaa 



aagaaggtag 
aggcctggtt 
gtggtcccgg 
ccttgcataa 
ctaagcgaaa 
agcccctgga 
ctctgctaaa 
ttggaactga 
gagacattaa 
cagacacatc 
aggactttgg 
gagaaaggag 
atccacaact 
aagttctgga 
gcgccacaag 
gaactcgcca 
atatcaaagc 
aaaccaaagg 
cttgatggtc 
aaataggaaa 
cctttaaatc 
tgttttcccc 
tgtctatgta 
aaaaaaaa 



agataaagac 
tattgaaaat 
atcagcggtg 
ggccataatg 
caatgcacag 
tgaaacactg 
aactccagcg 
tgaagatact 
cagggtctac 
tggagatttt 
tgtgaatgaa 
aaaggggaca 
tcgcagagtg 
cctggagttg 
caaaccagct 
taaggcattg 
attctatcag 
agattatgag 
tcaagctatg 
gtttcttcaa 
atttttatat 
aaaccataaa 
gctgaaaata 



actttttcaa 
gaagagcagg 
agcccctatc 
gttaaaggtg 
cgtcaacaga 
aagaaagccc 
caatttgatg 
ctaattgaga 
agagaggaac 
cggaacgctt 
gacttggctg 
gacgtaaacg 
tttcagaaat 
aaaggtgaca 
ttctttgcag 
atcaggatta 
aagatgtatg 
aaaatcctgg 
atcagaagac 
caggattaca 
tataactctg 
accctataca 
aaatgacgtc 



aaatggcaat 
aatatgttca 
ctaccttcaa 
tggatgaagc 
tcaaagcagc 
ttacaggtca 
ctgatgaact 
ttttggcatc 
tgaagagaga 
tgctttctct 
attcagatgc 
tgttcaatac 
acaccaagta 
ttgagaaatg 
agaagcttca 
tggtttcccg 
gtatctccct 
tggctctttg 
tttaattata 
gtgtagctac 
tafcaatagag 
agttgttcta 
acaagacaaa 



60 
120 
180 
240 
300 
360 
420 
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600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1408 




DE Homo sapiens peroxisomal D3 ,D2-enoyl-CoA isomerase, mRNA (cDNA clone 
MGC:3558 IMAGE : 3608151) , complete cds . 

/ trans lat ion= " MRASQKDFBNSMNQVKLLKKDPGNEVKLKIjYAIiyKQATEGPCNMP 
KPGVFDLINKAKWDAWNALGSLPKEAARQNYVDLVS SLSPSLES SS QVEPGTDRKSTGF 
ETLVVTSEDGITKIMFNRPKKKNAINTEMYHEIMRAIjKAASKDDS I ITVIjTGNGDYYS S 
GNDLTOTTDIPPGGVEEKAKNNAVLLI^ 

DAVYASDRATFHTPFSHLGQS PBGC S S YTFPKIMS P AKATEMLI FGKKLTAGEACAQGIi 
VTEVFPDSTFQKEVWTRIjKAFAKIiPPNAIjRISKEVIRKREREKLH^ 
IiSDECTNAWNFLSRKSKL n 

gagccgccca agggatggcg atggcgtact tggcttggag actggcgcgg cgttcgtgtc 
cgagttctct gcaggtcact agtttcccgg tagttcagct gcacatgaat agaacagcaa 
tgagagccag tcagaaggac tttgaaaatt caatgaatca agtgaaactc ttgaaaaagg 
atccaggaaa cgaagtgaag ctaaaactct acgcgctata taagcaggcc actgaaggac 
cttgtaacat gcccaaacca ggtgtatttg acttgatcaa caaggccaaa tgggacgcat 
ggaatgccct tggcagcctg cccaaggaag ctgccaggca gaactatgtg gatttggtgt 
ccagtttgag tccttcattg gaatcctcta gtcaggtgga gcctggaaca gacaggaaat 
caactgggtt tgaaactctg gtggtgacct ccgaagatgg catcacaaag atcatgttca 
accggcccaa aaagaaaaat gccataaaca ctgagatgta tcatgaaatt atgcgtgcac 
ttaaagctgc cagcaaggat gactcaatca tcactgtttt aacaggaaat ggtgactatt 
acagtagtgg gaatgatctg actaacttca ctgatattcc ccctggtgga gtagaggaga 
aagctaaaaa taatgccgtt ttactgaggg aatttgtggg ctgttttata gattttccta 
agcctctgat tgcagtggtc aatggtccag ctgtgggcat ctccgtcacc ctccttgggc 
tattcgatgc cgtgtatgca tctgacaggg caacatttca tacaccattt agtcacctag 
gccaaagtcc ggaaggatgc tcctcttaca cttttccgaa gataatgagc ccagccaagg 
caacagagat gcttattttt ggaaagaagt taacagcggg agaggcatgt gctcaaggac 
ttgttactga agttttccct gatagcactt ttcagaaaga agtctggacc aggctgaagg 
catttgcaaa gcttccccca aatgccttga gaatttcaaa agaggtaatc aggaaaagag 
agagagaaaa actacacgct gttaatgctg aagaatgcaa tgtccttcag ggaagatggc 
tatcagatga atgcacaaat gctgtggtga acttcttatc cagaaaatca aaactgtgat 
gaccactaca gcagagtaaa gcatgtccaa ggaaggatgt gctgttacct ctgatttcca 
gtactggaac taaataagct tcattgtgcc ttttgtagtg ctagaatatc aattacaatg 
atgatatttc actacagctc tgatgaataa aaagttttgt aaaacaaaaa aaaaaaaaaa 
aaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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1020 
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1383 



DE 
DE 




FT 
FT 
FT 
FT 
FT 



/translation="MGRPRPRAAKTWMFLLLIiGGAWAGH^ 

WQAALFQGQQLLCGGVLVCTCNWVL^ 

PHPCYNSSDVEDHNHDLMIJL,QI^ 




cgcccctcgt gatgtcaggg gcgcagtagc tccgcccacg tggagctcgg gcggtgtaga 
gctcagcccc ttgtggcccc gtcctgggcg tgtgctgggt ttgaatcctg gcggagacct 
ggggggaaat tgagggaggg tctggatacc tttagagcca atgcaacgga tgatttttca 
gtaaacgcgg gaaacctcac cttctttctg cctgagctgt gagatgagtg gagagcaaac 
gggtgggcgg tgaagggcag atgagggaac cggtaccgcc ttgcaactcc cccttaaacc 
cctatgttcc agttcccaga agctccccag gctctagtgc aggaggagaa ggaggaggag 
caggaggtgg agattcccag ttaaaaggct ccagaatcgt gtaccaggca gagaactgaa 
gtactggggc ctcctccact gggtccgaat cagtaggtga ccccgcccct ggattctgga 
agacctcacc atgggacgcc cccgacctcg tgcggccaag acgtggatgt tcctgctctt 
gctgggggga gcctgggcag gacactccag ggcacaggag gacaaggtgc tggggggtca 
tgagtgccaa ccccattcgc agccttggca ggcggccttg ttccagggcc agcaactact 
ctgtggcggt gtccttgtag gtggcaactg ggtccttaca gctgcccact gtaaaaaacc 
gaaatacaca gtacgcctgg gagaccacag cctacagaat aaagatggcc cagagcaaga 
aatacctgtg gttcagtcca tcccacaccc ctgctacaac agcagcgatg tggaggacca 
caaccatgat ctgatgcttc ttcaactgcg tgaccaggca tccctggggt ccaaagtgaa 
gcccatcagc ctggcagatc attgcaccca gcctggccag aagtgcaccg tctcaggctg 
gggcactgtc accagtcccc gagagaattt tcctgacact ctcaactgtg cagaagtaaa 
aatctttccc cagaagaagt gtgaggatgc ttacccgggg cagatcacag atgtcatggt 
ctgtgcaggc agcagcaaag gggctgacac gtgccagggc gattctggag gccccctggt 
gtgtgatggt gcactccagg gcatcacatc ctggggctca gacccctgtg ggaggtccga 
caaacctggc gtctatacca acatctgccg ctacctggac tggatcaaga agatcatagg 
cagcaagggc tgattctagg ataagcacta gatctccctt aataaactca caactctcaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
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Homo sapiens RTN2-A (RTN2) TORN A, complete cds. 



/ trans lation= " MGQVLPWAHCKEAPSTASSTPDSTEGGNDDSDFRELHTAREFSE 
EDEEETTSQDWGTPRELTFSYIAFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPS 
LGDSLESIPSLSQSPEPGRRGDPDTAPPSERPLEDLRLRl»DHIiGWVARGTGSGEDSSTS 
SSTPLEDEEPQEPNRLETGEAGEEIiDIiRIiRIiAQPSSPEVLTPQLSPGSGTPQAGTPSPS 
RSRDSNSGPEEPLLEEEEKQWGPLEIlEPVRGQCIiDSTDQIiEFlVEPRljIjGTAMEWIjKTS 

IiLIAVYKTVPILELSPPLWTM^ 

MGS KVADLL YWKDTRT S GWFTGLMVS LLCLLHF S IVS VAAHI1ALX1LLCGT I S LRVYRK 
YLQ AVHRGD GANP F Q AYLDVDLTLTREQTERL SHQ ITS RWS AATQLRHF FLVEDLVD S 
LKLALLFYILTFVGAI FNGLTLIilLGVIGLFTI PLIiYRQHQAQIDQYVGIiVTNQLSHIK 
AKIRAKI PGTGALAS AAAAVSGSKAKAE » 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



cccgggagga ggaggcggcg agaatggcag cggcgtcgtg ggcgcggcgg agatgagcgc 
ccgcgacccc gggcccaggg cggcacagcc ggagtgggcg ggggtcccga tgcaggcccg 
aggggggcca tggggcaggt cctgccggtc ttcgcccact gcaaagaagc tccgtctaca 
gcctcctcaa ctcctgattc cacagaagga gggaacgacg actctgattt tcgagagctg 
cacacagccc gggaattctc agaggaggac gaggaggaga ccacgtcgca ggactggggc 
accccccggg agctgacctt ctcctacatc gcctttgatg gtgtagtggg ctccgggggc 
cgcagggatt caactgcccg ccgcccccgc ccccagggcc gctcagtctc ggaaccacga 
gaccagcacc ctcagcccag cctgggcgac agcttggaga gcatccccag cctgagccaa 
tccccggagc ctggacgacg gggtgatcct gacaccgcgc ctccatccga gcgccctctg 
gaagacctga ggcttcggtt ggaccatctg ggctgggtgg cccggggaac gggatccggg 
gaggactctt ccaccagcag ctccaccccg ctggaagacg aagaacccca agaacccaac 
agattggaga caggagaagc tggggaagaa ctggacctac gactccgact tgctcagccc 
tcatcgcccg aggtcttgac tccccagctc agtccgggct ctgggacacc ccaggccggt 
actccgtccc catcccgatc gcgagattcg aactctgggc ccgaagagcc attgctggaa 
gaggaagaaa agcagtgggg gccactggag cgagagccag taaggggaca gtgcctcgat 
agcacggacc aattagaatt cacggtggag ccacgccttc taggaacagc tatggaatgg 960 
ttaaagacat cattgctttt ggctgtttac aagacggttc caattttgga attgtcccca 1020 
cctctgtgga cagccattgg ctgggtccaa aggggcccca ccccccctac tcctgtcctc 
cgggttctac tgaagtgggc aaaatccccg agaagcagcg gtgtccccag cctctcactc 
ggagccgata tggggagtaa agtggcggac ctgctgtact ggaaggacac gaggacgtca 
ggagtggtct tcacaggcct gatggtctcc ctcctctgcc tcctgcactt tagcatcgtg 
tccgtggccg cgcacttggc tctgttgctg ctctgcggca ccatctctct cagggtttac 
cgcaaagtgc tgcaggccgt gcaccggggg gatggagcca accctttcca ggcctacctg 
gatgtggacc tcaccctgac tcgggagcag acggaacgtt tgtcccacca gatcacctcc 
cgcgtggtct cggcggccac gcagctgcgg cacttcttcc tggtagaaga cctcgtggat 
tccctcaagc tggccctcct cttctacatc ttgaccttcg tgggtgccat cttcaatggt 
ttgactcttc tcattctggg agtgattggt ctattcacca tccccctgct gtaccggcag 1620 
caccaggctc agatcgacca atatgtgggg ttggtgacca atcagttgag ccacatcaaa 16 8 0 

gctaagatcc gagctaaaat cccagggacc ggagccctgg cctctgcagc agccgcagtc 174U 
tccggatcca aagccaaagc cgaatgagaa cggtgtctct gcccgcagga cgcctgcccc 1800 
cagcccccgc agccctctgg ccccctccat ctcttgtccg ttcccaccca cccccctcct I860 
cggcccgagc cttttcccgg tgggtgtcag gatcactccc actagggact ctgcgctaat 
tacctgagcg accaggacta catttcccaa gaggctctgc tccaggagtc caggaaagac 
gaggcacctt ggccgcgggg cctgctggga cttgtagttg cctagacagg gcaccaccct 
gcacttccgg acccgccgct ggaggcgccg tgaggcgttg gtgtctcctg gatgctacta 
gccccaacgc cggggctttg catggggccc aggggaggcc tgagcttgga tttacactgt 
aataaagact cctgtggaaa aaaaaaaaaa 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



1920 
1980 
2040 
2100 
2160 
2190 



DE 

FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
?T 
FT 
?T 
FT 



Human triRNA for KIAA0188 gene, partial cds. 



/translation=»HARRRSVQTMl^GQLAGQVFVTVKELYKX3LNPATLSGCIDIIVI 
RQPNGNLQCS PFHVRFGKMGVLRSREKWD IE INGESVDLHMKLGDNGEAPFVQETDND 

QEVlPMHLATSPILSEGASRMECQLKRGSVDRMRGIiDPSTPAQVIAPSETPSSSSVVKK 
RRKRRRKSQLDSLKRDDNMNTSEDEDMFPIEMSSDEAMEIiLESSRTIjPNDIPPFQDDIP 
EENLSLAVIYPQSASYPNSDREWSPTPSPSGSRPSTPKSDSELVSKSTERTGQKNPEML 
WLWGELPQAAKSSSPHKMKESSPLSSRKICDKSHFQAIHSESSDTFSDQSPTLVGGALL 
^^QTEMQFVNEEDLETLGAAAPLLPMIEELKPPSASWQTANKTDSPSRKRDKRSR 
HLGADGVyiiDDLTDMDPEVAALYFPKNGDPSGLAKHASDNGARSANQSPQSVGSSGVDS 

GVESTSDGLRDIjPS iai slcgglsdhreitkdafleqavs yqqfvdnpai iddpnl wk 

IGSKYYNWTTAAPLLLAMQAFQKPLPKATVESIMRDKMPKKGGRWWFSWRGRNTTIKEE 
SKPEQCIAGKAHSTGEQPPQLSIATRVKHESSSSDEEPAAAKPSNAGHLPLLPNVSYKK 
TIJ^TSEQLKSLKLKNGPNDVVFSVTTQYQGTCRCEGTIYLWNWDDKVIISDIDGTITR 
OTTLGHILPTLGKDWTHQGIAKLYHKVSQNGYKFIjYCSARAIGMADMTRGYLHWVNERG 
TVLPQGPLLLSPSSLFSALHRBVIEKKPEKFKVQCLTDIKNIiFFPNTEPFYAAFGIIRPA 

^ykqvgvsi^iftwkgelvqehaktnissyvrlcevvdhvfpllkrshssdfp 

?T CSDTFSNFTFWREPLPPFENQDIHSASA" 

SJSSSS Cgccgctcgg tgcagaccat gaattacgtg gggcagttag ccggccaggt 60 

gtttgtcacc gtgaaggagc tctacaaggg gctgaatccc gccacactct cagggtgcat 12 0 

tgacatcatt gtcatccgcc agcccaatgg aaacctccaa tgctcccctt tccacgtccg 180 

atgg ? ggt * c tgcgctcccg agagaaagtg gttgacatag aaatcaatgg 24 0 

ggaatctgtg gatttgcata tgaaattggg agataatgga gaagcatttt ttgttcaaga 30 0 

aacagataat gatcaggaag ttatccctat gcacctggcc acctccccca tcctgtcaga 360 

lllltrnlln agaatggaat ^cagctgaa aaggggctct gtggacagga tgagaggcct 42 0 

llttllltll a = 9CCagCCC ^agtgatcgc tcccagcgag acgccgtcaa gcagclctgt 480 

agtaaagaag agaagaaaaa ggaggagaaa gtcacagctg gacagcctga agagagatga 540 

ca^aa^ a * atCt 9 a " -tgaggacat gttccccatc gagafgagct cggatglgic Too 

catggagctg ctggagagca gcagaactct tcctaatgat atacctccat tccaagatga 660 

tattcctgag gaaaacctct ccctggctgt gatttaccct cagtcagcct cataccctaa 72 0 

ttcggataga gagtggtcac ccactcccag tccttccggt tcccgacctt caacacctaa 780 

£c?3££S gaattggtca 9caagtccac ggaaaggaca gggcagaaga acccagaaat 840 

gctttggctg tggggagagc tgccgcaggc tgctaagtct tcttctccac acaagatgaa 900 

agagtccagc ccattgagca gtagaaaaat ttgtgataaa agtcactttc aggccattca 960 

cagcgaatct tcagacactt ttagtgacca atcgccaact ctggtcggtg gggcactttt 1020 

llllatlllt aagcctcaga <=agaaa t gca gtttgtgaat gaagaagacf Jggagacctt loll 

aggagcagca gcgccactct tgcccatgat cgaggagctc aaacccccct c?gccagtgt 1140 

Staafoo 3 ? Caaacaa9a c 39^ttctcc ttccaggaaa agagataaac gaagccgaca 1200 

tcttggtgct gacggcgtct acttggatga cctcacagac atggatcctg aagtggcggc 1260 

! cccaaaaacg gagatccttc cggactcgca aaacatgcal gcgaSaclg \Ho 

Illicit, gcCaacca9t ccccgcagtc ggtgggcagc tcgggcgtgg acagtggcgl 13 80 

ggagagcacc tcggacgggc tgagggacct cccttccatc gccatctccc tctgcggggg 1440 

T^lllT CaCC99gaga tc ^^3aaaga tgcattcctg gagcaagctg tgtLS?ca 1500 

acagtttgtg gacaaccccg ctattatcga tgaccccaat ctcgtggtaa agattgggag 1560 

taaatattat aactggacaa cagcagcacc cctcctcctg gcaatgcagg ccttcSglf \l 2 l 

acctttgcca aaggccactg tggaatctat catgagggat aaaatgccca aaaagggagg 1680 

aagatggtgg ttttcatgga ggggaagaaa caccacaatc aaggaggaaa gtaagSaga lllo 

!™S£S S ct 99caagg cccatagcac cggagagcaa ccgccgcagc tcagcttggc 1800 

S aagcatgaat catcctccag tgatgaggag cgcgoagctg ccaagcca?c i860 

aaacgcaggc cacctccctc ttctgcctaa tgtcagctac aagaagactc tccggctgac 1920 

ttccgagcag cttaaaagct tgaagttgaa gaatggcccc aacgacgtgg ttttcagtgt 1980 

caccacgcag taccaaggca cgtgccgctg tgagggcacc atctatctgt ggaactggga 2040 

tgataaagtc atcatttctg atattgatgg gacaattacc agatcagaL clcttggKa lloo 

cattttgccc acccttggga aggattggac ccatcagggc atcgctaagc tgtaccataa 2160 

a lS a r° a9 aatggatata aatttctcta ctgttctgcc cgtgccatcg ggatggcgga 2220 

catgacgcgg ggctacctgc actgggtcaa cgagaggggc acggtgctgc cccaggggcc 2280 



cctgctgctg agtcccagca gcctcttctc tgccctgcac agagaagtga ttgaaaagaa 
gccagaaaag tttaaagtcc agtgtttgac agacatcaaa aacctgtttt tccccaacac 
agaacccttt tatgctgctt ttggaaaccg accagctgat gtgtattcat acaagcaagt 
aggagtgtct ttgaatagaa tatttaccgt caaccctaaa ggagagctgg tacaggaaca 
tgcaaagacc aacatctctt cgtatgtgag actctgtgaa gtagtcgacc acgttttccc 
gttgctgaaa agaagccatt cttcagactt tccctgttcg gataccttca gtaacttcac 
cttttggaga gagccactgc caccttttga aaaccaggac attcattctg cctcagcgta 
aaatgtccca agcagcctct tgccagcagt gcagagcctg gttgtcaccc attaaaggat 
aggtctcccc ggagtgcaca gctccacctg ggagcctggc gcgtcatcat tggcctgaca 
gcagagagaa ttgagaagca tttctcccct gccccacccc ggggctgaca tttctaagca 
agataggaag ggagcacttt ctaggctagg agttgggtgc atttgtaccg tgaaaagcat 
tcctcagttg tggcttaatg ccagttacga cgctgccttt ccggcctgct ccagcaagta 
gctactggtt cacgtgcagt ttggggctgt gaaacctagg cagaaggcgg ctgtctgagg 
gctgtccccg cctaggacag ggtcaatcga ggaatgccag atgtgcacgg tttttggcaa 
agtagggggc acatttccat tatagcaatg ttagtgccac caccttctga acacagtggg 
gagggctgtg aaggctcatg tgacctggat ctgaggtctc tgatagaaat ctggacgcca 
ccgggtccag gcctggcctc agacttggcc ttgtggatgg gccccttaca gtatttgctg 
actagtctca tttttaggtg ataagttttt ctttaattcc tttggttaaa gatagtctat 
ttcattggca tatctccccc cagtttttgt ggctcaaggc tggaatattt atgccttaat 
atatctatgg cagacattta agaatgcgct ttatctagct catggtaact ttgcaacgcc 
ttagattaaa atgacagtaa atattactaa ggcagtattt tgaatgagtt tgacactgcc 
ggcttccttc catccagcga ggtggtgctg acagtgtgga cttgagcaca cttatgccaa 
atgataatga tactgacttc tgttgggagc tctccaaaga aactggttgg ttttaagaaa 
atagtttcaa gaagttcaac tatattcttt tagatattat gtattgtttt actctgatta 
ggttactgtg ataggcattt attcatattc tttctatacc actgtcatta atatattaaa 
aagatgtatg tgttagacta tcgaaagggc cttattctct ctttctcata gactgacctt 
cttttggaat ttctgagtca tttattttcc ttagcttttt ccactcaaat taagggcaag 
cgaaaaagta ataatttggc attctttaag cctacagaat gtgattcttt cacttgttta 
ttacactggc tcgtggacag aacaatttga aaagtgaaag aattattttg gtaaaagatt 
ttgctttact tttcgaagca ttattttttt aaagagtgtt ttactccaac gattgaaaca 
ttttcctatt taaatttcat cgttagaatc acaggaggca aaaaatggaa cggttgaatg 
aaattttact ctttctgtga aagaaaatcc acagagttgt tgcctccgtt gtagttggtg 
ggccccgtta gcattggatg cctttgccaa atggttcatg tggacacaca aaggcaaaca 
gatctgccat cgatcgcaga tttctgtaga aacacggatg tgcatgtgca gattcccttt 
tgcaggtatt aaaaataatt aaaaatagtc ctgcctgagg ttgcagtgag ccgagcttgc 
actactgcac tccagcctgg gtgacagagt aagactccat gtcaaaaaaa aaaaaaaaaa 
aaaaaagtcc tgccttaact aactcctctg cgcttgttca ctagtaacct aaagaggcta 
tattcattct ttatgcaatg agggtatttt tgagtgaatt ttaactgctc tgaactaagt 
ataagctcat gggcctgcaa aggttcagac ggtttctcct ttgcacccag gaggaacttt 
ggctgcgaga atggggggat gtatccctca tgcagttggc atccaggcag ccctctgcag 
cagcacaccc tgcaggcgga gttttcagag gatgcaattt tggatcccga attttgatgt 
accttaaact tccacatcac tgcaccctga aacagagcat gctttccaga aagtcacact 
ctcagatctg tgtcaagttc aatgtgagcc ctggcaaggc tggcatatta acacctgcct 
tctggcttct gaaagtgaga tttgtatatg ggctgcactc acgcatatac gagttggttt 
atctttgtgt acatgactat aacccagtga tgctgaggtc atgtgctgga atgctgtatt 
tggaccacac atttcaaagt tgccctatgg aaatgaatcc tacttagtga caagtcatca 
aatgtttgtc acatgtgatg aagacaaata tgtatacctg gcatagagaa aaatatatac 
ctggtacatt ggagaaaaat aattacactt tcaaagagaa ttccctttgc aattttatgt 
ttggatcacc actgtaagca cactttattt gcatttgatc tgtatttgta tatgctgatg 
caatgataaa aatcactgta atacttcatt gtgttgtact ggatgcaaag ctagaaaata 
ttgcaataaa tgagaccgat gaaagac 



2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5307 



■E Homo sapiens 3 -hydroxy- 3 -me thylglutaryl- Coenzyme A synthase 1 (soluble) , 
E mRNA (cDNA clone IMAGE : 2819708 ) , partial cds . 



/ translation^ "HSLSSAAARSRLCPKEETVTDLETAVLyPSHSSFTMPGSLPLNAE 

ACWPKDVGIVA^IYFPSQYVDQAELEKYDGVDAGKYTIGIiGQ 

MTWQNLMERimiSYDCIGRLEVG 

CYGGTAAVFNAVNWI ES S S WDGRYALVVAGDIAVYATGNARPTGGVGAVALLIGPNAPL 
IFERGLRGTHMQHAYDFYKPDMLSEYPIVDG 

EGNDKDFTLNDFGFMI FHS PYCKLVQKSLARMLLNDFIiNDQNRDKNS I YSGIjEAFGDVK 
LEDTYFDRDVEKAFMKAS S ELFSQKTKASLLVSNQNGNMYTS S VYGSLAS VLAQYS PQQ 
IiAGKRIGVFSYGSGLAATLYSLKVTQDATPGSALDKITASLCDLKSMjDSRTGVAPDV 
AKNMKLREDTHHLVNYI PQGS IDSLFEGTWYLVRVDEKHRRTYARRPTPNDDTLDEGVG 
LVHSNIATEHI PSPAKKVPRLPATAAEPEAAVISNGEH " 



cactcccttt cctctgctgc cgctcggtca cgcttgtgcc cgaaggagga aacagtgaca 60 

gacctggaga ctgcagttct ctatccttca cacagctctt tcaccatgcc tggatcactt 120 

cctttgaatg cagaagcttg ctggccaaaa gatgtgggaa ttgttgccct tgagatctat 180 

tttccttctc aatatgttga tcaagcagag ttggaaaaat atgatggtgt agatgctgga 240 

aagtatacca ttggcttggg ccaggccaag atgggcttct gcacagatag agaagatatt 3 00 

aactctcttt gcatgactgt ggttcagaat cttatggaga gaaataacct ttcctatgat 360 

tgcattgggc ggctggaagt tggaacagag acaatcatcg acaaatcaaa gtctgtgaag 420 

actaatttga tgcagctgtt tgaagagtct gggaatacag atatagaagg aatcgacaca 4 80 

actaatgcat gctatggagg cacagctgct gtcttcaatg ctgttaactg gattgagtcc 540 

agctcttggg atggacggta tgccctggta gttgcaggag atattgctgt atatgccaca 600 

ggaaatgcta gacctacagg tggagttgga gcagtagctc tgctaattgg gccaaatgct 660 

cctttaattt ttgaacgagg gcttcgtggg acacatatgc aacatgccta tgatttttac 720 

aagcctgata tgctatctga atatcctata gtagatggaa aactctccat acagtgctac 780 

ctcagtgcat tagaccgctg ctactctgtc tactgcaaaa agatccatgc ccagtggcag 840 

aaagagggaa atgataaaga ttttaccttg aatgattttg gcttcatgat ctttcactca 900 

ccatattgta aactggttca gaaatctcta gctcggatgt tgctgaatga cttccttaat 960 

gaccagaata gagataaaaa tagtatctat agtggcctgg aagcctttgg ggatgttaaa 1020 

ttagaagaca cctactttga tagagatgtg gagaaggcat ttatgaaggc tagctctgaa 1080 

ctcttcagtc agaaaacaaa ggcatcttta cttgtatcaa atcaaaatgg aaatatgtac 1140 

acatcttcag tatatggttc ccttgcatct gttctagcac agtactcacc tcagcaatta 1200 

gcagggaaga gaattggagt gttttcttat ggttctggtt tggctgccac tctgtactct 12 60 

cttaaagtca cacaagatgc tacaccgggg tctgctcttg ataaaataac agcaagttta 1320 

tgtgatctta aatcaaggct tgattcaaga actggtgtgg caccagatgt cttcgctgaa 13 80 

aacatgaagc tcagagagga cacccatcat ttggtcaact atattcccca gggttcaata 144 0 

gattcactct ttgaaggaac gtggtactta gttagggtgg atgaaaagca cagaagaact 1500 

tacgctcggc gtcccactcc aaatgatgac actttggatg aaggagtagg acttgtgcat 1560 

tcaaacatag caactgagca tattccaagc cctgccaaga aagtaccaag actccctgcc 1620 

acagcagcag aacctgaagc agctgtcatt agtaatgggg aacattaaga tactctgtga 1680 

ggtgcaagac ttcagggtgg ggtgggcatg gggtgggggt atgggaacag ttggaggaat 1740 

gggatatctg gggataattt taaaggatta catgttatgt aaatttttat gtgactgaca 1800 

tggagcctgg atgactatcg tgtacttggg aaagtctctt tgctctattt gctgacatgc 1860 

ttcctgttgt ggtctggcca atgccaaatg tactcgaatg atgttaaggg ctctgtaaaa 1920 

cttcatacct ctttggccat ttgtatgcat gatgtttggt ttttaaacat ggtataatga 1980 

attgtgtact tctgtcagaa gaaagcagag gtactaatct ccaattaaaa aattttttaa 2040 

catgtaaaaa aaaaaaaaaa aaaaaaaa 2 068 



Homo sapiens S100 calcium binding protein A14, mRNA (cDNA clone MGC: 11012 

IMAGE : 3640899) , complete cds . 

/ translat ion= "MGQCRSANAEDAQEFSDVERAIETIilKNFHQYSVBGGKETLTPSE 
LRDLVTQQIjPHLMPSNCGLEEKIANIiGS cnds klefrs fwel igeaaks vklerpvrgh 

agatcatgag ccatcagctc ctctggggcc agctatagga caacagaact ctcaccaaag 
caccagacac agtgggcacc atgggacagt gtcggtcagc caacgcagag gatgctcagg 
aattXgtga tgtgglgagg gccattgaga ccctcatcaa gaactttcac cagtactccg 
tggagggtgg gaaglagaci ctgacccctt ctgagctacg ggacctggtc acccagcagc 
tgccccatct catgccgagc aactgtggcc tggaagagaa aattgccaac ctgggcagct 
gcaatgactc taaactggag ttcaggagtt tctgggagct gattggagaa ^ggccaaga 360 
itgtgaagct ggagaggcct gtccgggggc actgagaact ccctctggaa ttcttggggg 420 
Itg^gggga glgaclgtgg gcctggaaat aaaacttgtc tcctctacaa aaaaaaaaaa 480 
aaaaaaaaa 



60 
120 
180 
240 
300 



>B Homo sapiens cDNA clone:ADBALE09, 5 'end, expressed in human adrenal gland. 



aaaatatcat 
aagaaaagcc 
ctctctttga 
gtggagtatt 
ctttatttct 
tatcttgaaa 
tgatttttgt 
aaaaaaaaaa 
acccaccacc 
gccttggcag 
ctggcctccg 



ggattgaacc 
ggcaccaaga 
agagtccttc 
ttagtctaaa 
atttaccctt 
ctttgtgaac 
gtgattaaaa 
aaaaaaaaaa 
tttggcacag 
ttttcacgtt 
tc 



tcatcaattg 
acttccattc 
ttgattgcag 
ggcttttcaa 
ccatttttgt 
tgacttgctg 
caaaaaaaaa 
aaaaaaaaaa 
cccctttgtt 
taaaccagac 



atagcagtga 
taatctagag 
tgcagtactg 
attacttgaa 
atatcaaatt 
tatttgcact 
aaaaaaaaaa 
aactcgctcg 
tttacaccaa 
tcctttgccg 



gtgactgaag 
ctgaccagtt 
gcatttctga 
tttttttaaa 
tccattgtca 
ttgagctctt 
aaaaaaaaaa 
ggccgaattg 
taccaagaat 
gaacccaacc 



cttccaaatc 
tgagctgatt 
atggatgtaa 
aattgaggag 
ttaaaaactg 
gaaataaatg 
aaaaaaaaaa 
ggcacgagcc 
taagggggaa 
cgncaccctg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
612 



as43b01.xl Barstead aorta HPLRB6 Homo sapiens cDNA clone IMAGE : 23X9913 
mRNA sequence. 



tttaaaaaac 

acacaaatat 

tacacagcag 

gatacaaaat 

agttacatac 

tgaaaagact 

taaataaact 

acattccttc 

ccaaatccaa 

tgagatgagc 

tgc 



aaactgcaaa 
aagaggaaca 
tatctgttaa 
aagttactca 
ttcaaataac 
tcagattgtt 
aaattgactt 
attttattaa 
cactgtcatt 
tgcctgccct 



atggtattta 

atctgttatg 

gtcagtggtt 

ttcacatatt 

actggctaaa 

attcataaat 

caagactatt 

ggtgtagcta 

ctaattgcaa 

attttgtcaa 



tttacattaa 
cacaataact 
tgagtgaaaa 
ctaatcatac 
tgtacaacta 
gatccctttc 
tataaatagc 
tatactagag 
atagaattta 
tggttccaaa 



aacatgaatt 

gtaatattta 

cacagtacca 

aagacactta 

aagtttatta 

aggatgcatt 

ccactaaaat 

aatatgctca 

ttaaattcca 

gcattaacgg 



gcctgtatac 

gtacatgtta 

aaacattcct 

atattttaaa 

atttttttta 

atcttttaaa 

atgattgaag 

actactgcct 

cttcaggaca 

attaagagac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 

603 



)E 



™^ S «^ fi f rebrin -. 1# transcri P t variant 1, mRNA (cDNA clone MGC:1517 



)E IMAGE:3356428) , complete cds . 



?T 
?^ 

T DPVPEEEEGFEGGD" 



/ trans 1 a t ion= » MAGVS FSGHRLELLAAYEEVIREESAADWAL YTYEDGSDDLKLAA 
SGEGGLQELSGHFENQK^^GPCSVKDSQAALPK^An.INWVGEDVPDARKCACASHVAK 
VAEFFQGVDVT VNAS S VED IDAGAIGQRLSNGLARIiS S PVLHRIiRLREDENAEPVGTTY 
QKTDAAVEMKR INREOFWEO AK"KTS1?rt .t> wan-own. T-p.or.xT, or. »t». ~ 



« .^..^^^.ujyr w^v^^^iiJiijKiUiKKKiu^AJjDERLRFEQERMEQERQEQEERE 

RR YREREQQI EEHRRKQQTLEAEEAKRRIjKEQS I PGDHRDEEEETHMKKS ES EVEEAAA 
IIAQPJDNPP^PFKQQERVASASAGSCTVPSPFNHRPGSH1J)SHRRMAPTPIPTRSPSD 
SSTASTPVAEQIERALDEVTSSQPPPLPPPPPPAQETQEPSPILDSEETRAAAPQAWAG 
PMEEPPQAQAPPRGPGSPAEDLMFMESAEQAVLAAPVEPATADATEVHDAADTIETDTA 
TADTTVANNVPPAATSL IDLWPGNGEGASTLQGE PRAPTPPS GTEVTLAEVPLLDEVAP 

EPliPAGEGCATLLNFDELPEPPATFCDPEEVEGEPLAAPQTPTIiPSALEELEQEQEPE 

PHLLTNGETTQKEGTQASEGYFSQSQEEEFAQSEEliCAKAPPPVFyNKPPEIDITCWDA 
DPVPEEEEGFEGGD" 



60 
120 
180 
240 



ccgaggcggc ggcggcgact ccctctttcc ctccctcctc ctccgtccgc ccgtccgtcc 
gcgcgtctgt ccgttcggcc cggtccggcc cgaagcatgg ccggcgtcag cttcagcggc 
caccgcctgg agctgctggc ggcttacgag gaggtgatcc gagaggagag cgcggccgac 

tgggctctgt acacatatga agatggctcc gatgacctca agcttgcagc atcaggagaa ^ 

gggggcttgc aggagctttc gggacacttt gagaaccaga aggtgatgta cggcttctgc 300 

agtgtcaagg actcccaagc tgctctgcca aaatacgtgc tcatcaactg ggtgggcgaa 3 60 

gatgtgcctg atgcccgcaa gtgcgcttgt gccagccacg tggctaaggt ggcagagttc 420 

ttccagggtg tcgacgtgat cgtgaacgcc agcagcgtgg aagacataga cgcgggtgcc 480 

atcgggcagc ggctctctaa cgggctggcg cgactctcca gccctgtgct gcaccgactg 540 

cggctgcgag aggatgagaa cgcagagccc gtgggcacca cctaccagaa gacggatgca 600 

gctgtggaaa tgaagcggat taaccgagag cagttctggg agcaggccaa gaaggaagaa 660 

gagctgcgga aggaggagga gcggaagaag gccctggatg agaggctcag gttcgagcag 72 0 

gagcggatgg agcaggagcg gcaggagcaa gaggagcgcg agcggcgcta ccgggagcgg 78 0 

gagcagcaga tcgaggagca caggaggaaa cagcagactt tagaagcgga agaggccaag 840 

aggcggttga aggagcagtc tatctttggt gaccatcggg atgaggagga agagacccac 900 

atgaagaagt cagagtcgga ggtggaggag gcagcagcta ttattgccca gcggcctgac 960 

aacccaaggg agttcttcaa gcagcaggaa agagtcgcat cggcctctgc gggcagctgt 1020 

gatgtaccct cgcccttcaa ccatcgacca ggcagccacc tggacagcca ccggaggatg 10 80 

gcgcccactc ccatccccac gcggagcccg tctgactcca gcaccgcctc cacccctgtc n 40 

gctgagcaga tagagcgggc cctggatgag gtcacctcct cgcagcctcc accactgcca 12 00 

ccgccacccc caccagccca agagacccag gagcccagcc ccatcctaga cagtgaggag 12 60 

accagagcag cagcccctca ggcctgggcc ggccccatgg aggagccccc tcaggcacag 13 20 

f!^^^ CC ? gsg ? cca 99 cagccctgca gaggacttga tgttcatgga gtctgcagag 1380 

caggctgtcc tggctgctcc cgtggagcct gccacagctg acgccacgga ggtccacgat 14 40 

gcagctgaca ccattgaaac tgacactgcc actgctgaca ccactgttgc caacaacgta 15 00 

ccccccgccg ccaccagcct cattgaccta tggcctggca acggggaagg ggcctccaca 1560 

ctccagggtg agcccagggc ccccacgcca ccctcgggta ctgaggtcac cctggcagag 1620 

?S!^ C ^ 9C tg 9 at 9 a 99 t ggctccggag ccactgctgc cagcaggcga aggctgtgcc 1680 

acccttctca actttgatga gctgcctgag ccgccagcca ccttctgtga cccagaggaa 1740 

gtggaagggg agcccctggc tgccccccag accccaactc tgccctcagc ccttgaggag 1800 

ctggagcaag agcaggagcc ggagccccac ctgctaacca atggcgagac cacccagaag i860 

gaggggaccc aggccagtga ggggtacttc agtcaatcac aggaggagga gtttgcccaa 192 0 

tcggaagagc tctgtgccaa ggctccgcct cctgtgttct acaacaagcc tccagagatc 1980 

gacatcacat gctgggatgc agacccagtt ccagaagagg aggagggctt cgagggtggt 204 0 

gattagcggt ggcgccagcc ctaggctacc cttgccaagg ccgcccacct gcatcagcct 2 100 

ctggccagac ggcccgccgt gcctgcattc gcagcagctc cgcctggcac ccactccgga 2160 

ttccggccct ggctggggac ttggccgctt ccctacccac agggcctgac ttttacagct 2220 

tttctctttt tttaaaaagt tgataggaga cttgtacagt tgactggctt tcctctcgtt 2280 

ggtagttgag acgctgttgc aaattccacc cctccttccc tggtccagat tgtagctctt 2340 

agtcctccct gctcagctgg ccgggttgga ggcctcaccc tgcttggggc ctggcgtggg 2400 

gggagctctg gtgggaaaat gtcccccacc tcttttccta gttttatgtt tcttgggaaa 2460 



atatcacttt gtattctctg tccagggctt cagatatttt gcacgaattt taaaacatgg 
caataaatgg ctcgtgggct ctggcaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



2520 
2580 
2593 

aaaaaaaaaa aaa 



DE fSSi. S lliT^l°lT ially ^ lated P^tein tyrosine phosphatase hPRL-3 



OE mRNA, complete cds. 

/ 1 r ans la t i on= " MARMNRPAP VEVS YKHMRFL I THNPTNATLSTF IEDLKKYGATTV 
VRVCEVTYDKTPLEKDGITVVDWPFDDGAPPPGK^A^DVttSLVKAKFCEAPGSCVAVHC 

VAGLGRAPVLVALAIilESGMKYEDAIQPIRQKRRGRINSKQLTYLEKYliPKQRIiRFKDP 
HTHKTRCCVM " 



PT 
FT 
PT 
PT 



60 
120 



aagagttggg ttttcttttt taattatoca aacagtgggc agcttcctcc cccacaccca 

agtatttgca caatatttgt gcggggtatg ggggtgggtt tttaaatctc gtttctcttg 12 0 

llotaiZll SSTJ 96 tCtCCtcatt "ttgggggt gtgtggggac ttctcaggt? iso 

gtgtccccag ccttctctgc agtcccttct gccctgccgg gcccgtcggg aggcgccatg 24 0 

gctcggatga accgcccggc cccggtggag gtgagctaca aacacatgcg cficltcatc 300 

acccacaacc ccaccaacgc cacgctcagc accttcattg aggacctgaa gaagtacggg . 36 0 

gctaccactg tggtgcgtgt gtgtgaagtg acctatgaca aaacgccgct ggaSaggS 420 

ggcatcaccg ttgtggactg gccgtttgac gatggggcgo cccogcctgg SaggtStg tlo 

SSJ^S 90 ! 9agCCt99t ^SSCc a& 9 ttctgtgagg cccccggcag ctgcgtggc? 540 

9 tg9C999CCt 9g 99 c 999 ct ccagtccttg tggcgctggc gcttatlgag 600 

agcgggatga agtacgagga cgccatccag ttcatccgcc agaagcgccg cggacgcatc 660 

aacagcaagc agctcaccta cctggagaaa taccggccca aacagaggc? gcggtEcaaa 72S 

gacccacaca cgcacaagac ccggtgctgc gttatgtagc tcaggacctt ggctgggcct 780 

ggtcgtcatg taggtcagga ccttggctgg acctggaggc cctgccagcc ctgctctgcc 840 

SSSSKr 999 r CCa " cct ^9ot 99 ccccacatcg ccttttcctc cccgacacct 900 

tS?«£2 S?! C f 9a ? g a ^ a ^9C ccctcggcgc cttgggtggc ttctgggccc 960 

tttctcctgt ctccgtactc cctctggcgg cgctggcgtg gctctg 1006 



Homo sapiens cell cycle progression restoration 8 protein (CPR8) mRNA, 
complete cds. 

/ 1 r ans 1 a t i on= 11 MLKRELERERIiVTTALRGEIjQQIjS GSQIjHGKSDS PNVYTEKKE I A 

IIJlERLTEIiERKLTFEQQRSDLWERLYVEAKDQNGKQGTDGKKK 

FLGSVKETFDAMKNSTKEFVRHHKEKIKQAKEDVKE^ 

FDEKGNKRFNATKEAAEKPRWFSDYIiHPQYKAPTENHSRPYYAKRWKEEKPVHFra 
KNTNSKKCSPGHDCRENSHSFRKACSGVFDCAQQESMSLFNTVVIPIRMDEFRQIIQRY 
MLKELDTFOTWNELDQFINKFFLNGVFIHDQKLFTDFVNDV^ 
EKLDEYIYRHFFGHTFSPPYGPRSVYIKPCHYSSL » 

gaattcgcaa agatgctaaa gagagaactg gagagagaac gactagtaac tacggcttta 
aggggggaac tccagcagtt aagtggtagt cagttacatg gcaagtcaga ttctcccaat 
gtatatactg aaaaaaagga aatagcaatc ttacgggaaa gactcactga gctggaacgg 
aagctaacct tcgaacagca gcgttctgat ttgtgggaaa gattgtatgt tgaggcaaaa 
gatcaaaatg gaaaacaagg aacagatgga aaaaagaaag ggggcagagg aagccacagg 
gttaaaaata agtcaaaggg aacatttttg ggttcagtta aggaaacatt tgatgccatg 
aagaattcta ccaaggagtt tgtaaggcat cataaagaga aaattaagca ggctaaagaa 
gatgtgaagg aaaatctgaa aaaattctca gattcagtta aatccacttt cagacacttt 
aaagatacca ccaagaatat ctttgatgaa aagggtaata aaagatttaa tgctacaaaa 
gaagcagctg aaaaaccaag aacagttttt agtgactatt tacatccaca gtataaggca 
cctacagaaa accattcaag gccctactat gcaaaaagat ggaaggaaga aaagccagtt 
cactttaaag aattcagaaa aaatacaaat tcaaagaaat gcagtcctgg gcatgattgt 
agagaaaatt ' ctcattcttt cagaaaggct tgttctggtg tatttgattg tgctcaacaa 
gagtccatga gcctttttaa cacagtggtg atccctataa ggatggatga atttagacag 
ataattcaaa ggtacatgtt aaaagaactg gatacttttt gtcgctggaa cgaacttgat 
cagttcatca ataagttttt cctaaacggt gtctttatac atgatcagaa gctcttcact 
gactttgtta atgatgttaa gattatctta ggaaacatga aggaatatga agtagataat 
gatggagtat ttgagaagtt ggatgaatat atatatagac acttctttgg tcacactttt 
tcccctccat atggacccag gtcggtttac ataaaaccgt gtcattacag tagtttgtaa 
catttgtaga ttggatacga tttttatgat ttgatgagtt tcttgtaagg ttaccgtttc 
taagagttgt gctttatggc cactgagaga attcagaata aattgaaaga tggagtctaa 
aaattattag ctgttacaaa tggaacaatt tcattataac gtgatcactt tgacttgagc 
aaatggttta atttttatct taaaatcagt taagaatata taaaatccta cctttggcca 
agtttgtttc ttttcattat agtttatatg aaaagatcac cttaagtgaa attattttcc 
ttattttcct ttaatctttt atgtatttat tcacttctgg aagctaggaa tgagcaacac 
aaattttact ctgaagtcag aagagctcat atatataatt ctaatgtccc acctatgtcc 
attccatgta ccagcttagt tatatactag tcacataatt atctttgata aaggtagagg 
cacaaagagg caaactaaca agtcaaattc taatgtgtgt acttcataat aattttttat 
ccattttcat cttctttatc tttatattct gtaacatgaa acttacctaa tcttcaaatg 
ttagcttcat tttttacctt tgaaatactt aatctttctg aataaatata atggtctata 
aaaaaaaaaa aaaaaaaaaa aaaaaaaccg tcgaaaagcg gccgccaccg cgtgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1856 



Human channel-like integral membrane protein (CHIP28) mRNA, complete cds. 
channel -like integral membrane protein. 



/trans lation= "MASEFKKKLPWRAVVAEFLATTLFVFISIGSAIjGFKYPVGNNQTA 
VQDNVKVSLAFGLS IATIAQSVGHISGAHIiNPAVTLGLIiLSCQISIFRAIiMYI IAQCVG 
AIVATAILSGITSSI/TGNSLGRNDI^GVNSGQG^ 

LGGSAPLAIGLSVALGHLLAIDYTGCGINPARSFGSAVITHNFSNHWIFWVGPF 
AVLIYDFILAPRSSDIiTDRVKVWTSGQVEEYDIiDADDINSRVEMKPK" 



gcagtggagg gggcaagctt 



ggccagcgag 


ttcaagaaga 


60 


gaccctcttt 


gtcttcatca 


120 


caaccagacg 


gcggtccagg 


180 


cacgctggcg 


cagagtgtgg 


240 


ggggctgctg 


ctcagctgcc 


300 


gtgcgtgggg 


gccatcgtcg 


360 


gaactcgctt 


ggccgcaatg 


420 


cgagatcatc 


gggaccctcc 


480 


ccgtgacctt 


ggtggctcag 


540 


cctggctatt 


gactacactg 


600 


gatcacacac 


aacttcagca 


660 


cctggctgta 


ctcatctacg 


720 


cgtgaaggtg 


tggaccagcg 


780 


ctccagggtg 


gagatgaagc 


840 


caggggcagg 


ggcgggcgga 


900 


aagctggcca 


aagtcacttc 


960 


gggtgtttct 


atctctttct 


1020 


aagatttacc 


aattcaccca 


1080 


cctgctagtc 


gcccctcaga 


1140 


aagttgctca 


ccgactcacc 


1200 


tttgggcacg 


gccctccttc 


1260 


gggaagagat 


cccaggaggt 


1320 
1340 



Homo sapiens STRA6 isoform 1 mRNA, complete cds, alternatively spliced. 

/ trans lat ion= "MSSQPAGNQTSPGATEDYSYGSWYIDEPQGGEELQPEGEVPSCTT 

SIPPGLYHACT^ASLSILVIiLIjIiA^ 

AWMVLLSSIiCLLLPDEDALPFLTl4ASAPSQD^ 

CATAGHTAMIIiljGSTIiSWAHIjGVQWQRAECPQVPKIYKyYSL 

YPVQLVRSFSRRTGAGSKGLQSSYSEEYLRNIJjCRKKIjGSSYHTSKHGFIjS 

CIYTPQPGFHIjPIjKLVLSATIjTGTAIYQVALLLLVG^^ 

GIVLSEDKQEVVELVKHHIjWAIiEVCYI salvias clltflvxmrslvthrtnlralhrga 
ALDLS PLHRS PHPSRQAI FCWMS FSAYQTAF I CLGLLVQQ 1 1 FFLGTTAI1AFI1VLMPVL1 
HGRNLLLFRSLESSWPFWLTLAIiAVILQNMAAHWWLETHDGOT 
FPLNVLVGAIWATWRVLLSALYNAIHL^ 

SHPAI^AFCSLLLQAQSLLPRTMAAPQDSLRPGEEDEGMQIjLQTKDSMAKGARPGASRG 
RARWGLAYTXiIiHNPTLQVFRKTALLGANGAQP " 

agtcccagac gggcttttcc cagagagcta aaagagaagg gccagagaat gtcgtcccag 60 

ccagcaggga accagacctc ccccggggcc acagaggact actcctatgg cagctggtac 120 

atcgatgagc cccagggggg cgaggagctc cagccagagg gggaagtgcc ctcctgccac 180 

accagcatac cacccggcct gtaccacgcc tgcctggcct cgctgtcaat ccttgtgctg 240 

ctgctcctgg ccatgctggt gaggcgccgc cagctctggc ctgactgtgt gcgtggcagg 300 

cccggcctgc ccagccctgt ggatttcttg gctggggaca ggccccgggc agtgcctgct 360 

gctgttttca tggtcctcct gagctccctg tgtttgctgc tccccgacga ggacgcattg 420 

cccttcctga ctctcgcctc agcacccagc caagatggga aaactgaggc tccaagaggg 480 

gcctggaaga tactgggact gttctattat gctgccctct actaccctct ggctgcctgt 540 

gccacggctg gccacacagc tgcacacctg ctcggcagca cgctgtcctg ggcccacctt 600 

ggggtccagg tctggcagag ggcagagtgt ccccaggtgc ccaagatcta caagtactac 660 

tccctgctgg cctccctgcc tctcctgctg ggcctcggat tcctgagcct ttggtaccct 720 

gtgcagctgg tgagaagctt cagccgtagg acaggagcag gctccaaggg gctgcagagc 7 80 
agctactctg aggaatatct gaggaacctc ctttgcagga agaagctggg aagcagctac 840 
cacacctcca agcatggctt cctgtcctgg gcccgcgtct gcttgagaca ctgcatctac 900 
actccacagc caggattcca tctcccgctg aagctggtgc tttcagctac actgacaggg 960 

acggccattt accaggtggc cctgctgctg ctggtgggcg tggtacccac tatccagaag 1020 

gtgagggcag gggtcaccac ggatgtctcc tacctgctgg ccggctttgg aatcgtgctc 1080 

tccgaggaca agcaggaggt ggtggagctg gtgaagcacc atctgtgggc tctggaagtg 1140 

tgctacatct cagccttggt cttgtcctgc ttactcacct tcctggtcct gatgcgctca 1200 

ctggtgacac acaggaccaa ccttcgagct ctgcaccgag gagctgccct ggacttgagt 1260 

cccttgcatc ggagtcccca tccctcccgc caagccatat tctgttggat gagcttcagt 1320 

gcctaccaga cagcctttat ctgccttggg ctcctggtgc agcagatcat cttcttcctg 13 80 

ggaaccacgg ccctggcctt cctggtgctc atgcctgtgc tccatggcag gaacctcctg 1440 

ctcttccgtt ccctggagtc ctcgtggccc ttctggctga ctttggccct ggctgtgatc 1500 

ctgcagaaca tggcagccca ttgggtcttc ctggagactc atgatggaca cccacagctg 1560 

accaaccggc gagtgctcta tgcagccacc tttcttctct tccccctcaa tgtgctggtg 1620 

ggtgccatgg tggccacctg gcgagtgctc ctctctgccc tctacaacgc catccacctt 1680 

ggccagatgg acctcagcct gctgccaccg agagccgcca ctctcgaccc cggctactac 1740 

acgtaccgaa acttcttgaa gattgaagtc agccagtcgc atccagccat gacagccttc 1800 

tgctccctgc tcctgcaagc gcagagcctc ctacccagga ccatggcagc cccccaggac 1860 

agcctcagac caggggagga agacgaaggg atgcagctgc tacagacaaa ggactccatg 1920 

gccaagggag ctaggcccgg ggccagccgc ggcagggctc gctggggtct ggcctacacg 1980 

ctgctgcaca acccaaccct gcaggtcttc cgcaagacgg ccctgttggg tgccaatggt 2040 

gcccagccct gagggcaggg aaggtcaacc cacctgccca tctgtgctga ggcatgttcc 2100 

tgcctaccat cctcctccct ccccggctct cctcccagca tcacaccagc catgcagcca 2160 

gcaggtcctc cggatcactg tggttgggtg gaggtctgtc tgcactggga gcctcaggag 2220 

ggctctgctc cacccacttg gctatgggag agccagcagg ggttctggag aaaaaaactg 2280 

gtgggttagg gccttggtcc aggagccagt tgagccaggg cagccacatc caggcgtctc 2340 

cctaccctgg ctctgccatc agccttgaag ggcctcgatg aagccttctc tggaaccact 2400 

ccagcccagc tccacctcag ccttggcctt cacgctgtgg aagcagccaa ggcacttcct 2460 

caccccctca gcgccacgga cctctctggg gagtggccgg aaagctcccg gtcctctggc 2 520 



ctgcagggca gcccaagtca tgactcagac caggtcccac actgagctgc ccacactcga 
gagccagata tttttgtagt ttttatgcct ttggctatta tgaaagaggt tagtgtgttc 
cctgcaataa acttgttcct gagaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 




Homo sapiens solute carrier family 7 (cationic amino acid transporter, y+ 
system), member 7, mRNA (cDNA clone MGC : 1534 IMAGE : 3504357) , complete cds . 

/ tr ans lat ion= n MVDSTEYEVASQPEVETSPIjGDGASPGPEQVKIjKKE I SLLNGVCIi 
I VGNM IGSG I FVS PKGVLI YS ASFGIiSLVIWAVGGLFS VFGALCYAEIiGTT IKKS GAS Y 
AYILEAFGGFIiAFIRIjVn?SIaIjIIEPTSQAIIAITFANYMVQPIiFPSCFAPYAASRIjliAA 
AC I CLLTF INCAYVKWGTLVQD I FTYAKVLAL I AVI VAGI VRLGQGAS THFENS FEGS S 
FAVGDIAIiAIiYSALFSYSGVTOTIiNYVTEEIKNPEROT^ 

YTVLDMRD I IiASDAVAVTFADQ I FGI FNWI IPLSVALS CFGGLNAS I VAASRLFFVGSR 
EGHLPD AI CMIHVERFTPVPSLLFNG IMALI YLCVEDI FQL INYYS FS YWFFVGLS I VG 
QIiYLRWKEPDRPRPLKLSVFFPIVFCLCTIFLVAVPLYSDTINSLIGIAIAIjSGLPFYF 
LI IRVPEHKRPIiYIiRRIVGS ATRYIiQVLCMSVAAEMDLEDGGEMPKQRDPKSN " 

gagcatcaga ccacagatcc tggaaggcac ttctctccct gactgctgct cacactgccg 
tgagaacctg cttatatcca ggaccaagga gtgagtggca atgccaggaa gctggtgaag 
ggtttcctct cctccaccat ggttgacagc actgagtatg aagtggcctc ccagcctgag 
gtggaaacct cccctttggg tgatggggcc agcccagggc cggagcaggt gaagctgaag 
aaggagatct cactgcttaa cggcgtgtgc ctgattgtgg ggaacatgat cggctcgggc 
atctttgttt cccccaaggg tgtgctcata tacagtgcct cctttggtct ctctctggtc 
atctgggctg tcgggggcct cttctccgtc tttggggccc tttgttatgc ggaactgggc 
accaccatta agaaatctgg ggccagctat gcctatatcc tggaggcctt tggaggattc 
cttgctttca tcagactctg gacctccctg ctcatcattg agcccaccag ccaggccatc 
attgccatca cctttgccaa ctacatggta cagcctctct tcccgagctg cttcgcccct 
tatgctgcca gccgcctgct ggctgctgcc tgcatttgtc tcttaacctt cattaactgt 
gcctatgtca aatggggaac cctggtacaa gatattttca cctatgctaa agtattggca 
ctgatcgcgg tcatcgttgc aggcattgtt agacttggcc agggagcctc tactcatttt 
gagaattcct ttgagggttc atcatttgca gtgggtgaca ttgccctggc actgtactca 
gctctgttct cctactcagg ctgggacacc ctcaactatg tcactgaaga gatcaagaat 
cctgagagga acctgcccct ctccattggc atctccatgc ccattgtcac catcatctat 
atcttgacca atgtggccta ttatactgtg ctagacatga gagacatctt ggccagtgat 
gctgttgctg tgacttttgc agatcagata tttggaatat ttaactggat aattccactg 
tcagttgcat tatcctgttt tggtggcctc aatgcctcca ttgtggctgc ttctaggctt 
ttctttgtgg gctcaagaga aggccatctc cctgatgcca tctgcatgat ccatgttgag 
cggttcacac cagtgccttc tctgctcttc aatggtatca tggcattgat ctacttgtgc 
gtggaagaca tcttccagct cattaactac tacagcttca gctactggtt ctttgtgggg 
ctttctattg tgggtcagct ttatctgcgc tggaaggagc ctgatcgacc tcgtcccctc 
aagctcagcg ttttcttccc gattgtcttc tgcctctgca ccatcttcct ggtggctgtt 
ccactttaca gtgatactat caactccctc atcggcattg ccattgccct ctcaggcctg 
cccttttact tcctcatcat cagagtgcca gaacataagc gaccgcttta cctccgaagg 
atcgtggggt ctgccacaag gtacctccag gtcctgtgta tgtcagttgc tgcagaaatg 
gatttggaag atggaggaga gatgcccaag caacgggatc ccaagtctaa ctaaacacca 
tctggaatcc tgatgtggaa agcaggggtt tctggtctac tggctagagc taaggaagtt 
gaaaaggaaa gctcacttct ttggaggcac ctgtccagaa gcctggccta ggcagcttca 
acctttgaac ttactttttg aaatgaaaag taatttattt gttttgctac atactgttcc 
agacttttaa aggggacaat gaaggtgact gtggggagga gcatgtcagg tttgggcttg 
gttgttttag aagcacctgg gtgtgcctac ctactcctct tttcttttaa aagggcccac 
aatgctccaa tttcctgtct cctttagaga gacatgaaac tatcacaggt gctggatgac 
aataaaagtt tatgttccta aaaaaaaaaa aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



)E 601440558F1 NIHJ4GCJ72 Homo sapiens cDNA clone IMAGE : 3925214 5' mRNA 
)E sequence . 



ttctaatctt tctctggggg gaacaggcca cagaactgtg ttagaggtga accatcttaa 60 

ttactagttc tattacctaa ttcagcttcc ttgtttggtc tgctgtggat ctgccttatt 120 

gcatacgcca tgcatcagat aatggatgca tcagataatg gtgttagaca aagcttcatt 180 

gtgaacaacc taatgcattt tagagaaaca atctcatcac attttttcta gcctttccta 240 

catttaaact tgctgttgcc caaattataa ttttttaaat gtctttggtg ggcttctgtt 300 

aattcacatg acttgagctt atagctatgt ctactgcaca gattgggtaa tggaacacta 360 

aacttttata cttgaaaatg acagccttaa atgctcatat cagtcacaaa tctaggatgt 420 

actgtcttgt tgtatgtgag ctttgtagag atttttaaaa atataagcat caccttccca 480 

ttgaagagtg gagagagtct actggatgac tggccaggaa ctttctctct gaatcggaca 540 

tttggatgtc ttctttcttc caagaaatgg tggttcacat taaagtatca tggcttatgt 600 

atgctcaaat ggaatcttat gttactttct tattaaatct ggtctgccta ttttagataa 660 

catgcacggc atgtcttcac caatcacata tagctgagat gcacacatgg ataacgacat 720 

ccacggtcat ttcaaattcc actgccttgg ggatctaaga tatcgggggg caggcccaac 780 

attccgacct cactgtctaa tcagtgagca atgctccaaa acattgatgc ccacggcacc 840 

tccccgtgtc ccaacgaaca accccacaat tccaacgcgg ccccccgatt ctccaccgct 900 
ctgctcactt actaatag 



DE Human DNA for insulin-like growth factor II (IGF-2) ; exon 7 and additional 
ORF. 

/ trans 1 at ion= »»DNFPRYPVGKFFQYDTWKQSTQIUjRRGLPALIiRARRGHVI*AKELE 
AFREAKRHRPLIALPTQDPAHGGAPPEMASNRK" 



gacaacttcc 
acccagcgcc 
gccaaggagc 
acccaagacc 
aaactgccgc 
gtttccatca 
gacccagtcc 
ggctgaggaa 
ttcacatacc 
taaactaacc 
aaacacccca 
ttggctaaaa 
gaatttggca 
ctggacttga 
cacgtcgctc 
gaaaatagca 
gcagtcagag 
catgcaaatg 
tcctaaaaag 
acgcatgaac 
aacgcacagc 
cacacgcaca 
gcacacacag 
acagcacaca 
cagcacacta 
cacacgcgaa 
cacatgcaca 
tgcacacagc 
cacacatgca 
cacacaaaca 
cacacacacg 
gatatctgta 
gcccccctcc 
ggagggggct 
aggtgggcgg 
gtgaggctgg 
cccatacccc 
ggaaggagtt 
gcctgactcc 
cctagggatc 
gccgtgcact 
gtgggggctg 
aaagtcagga 
atttcatgtc 
ttttttttgt 
ccccccccat 
accaccacgt 
cctgttgctt 
tggagaagtc 
ggtccccaga 
agggtgctcg 



ccagataccc 
tgcgcagggg 
tcgaggcgtt 
ccgcccacgg 
aagtctgcag 
ggttccatcc 
ccgtgccccg 
gcacagcagc 
ctccccccaa 
cccttccccc 
caaaagctca 
aaaaaaagta 
ctccccaccc 
gtccctgaac 
gtaccgccat 
acaacccaga 
ggacccagta 
aaatgtaatt 
cactcatact 
acagcacaca 
acacacagca 
cacatgcaca 
cacacacgca 
cacgagcaca 
gcacacagca 
cacagcacac 
cagcacatgc 
acacacgcat 
cacacagcac 
cgcagcacac 
cacaccccca 
tggatcaggc 
tgttcccatc 
cagatggtgc 
ggtgtgtccc 
cctccttggc 
acccattccc 
tggccacatt 
ctggtgtgct 
atctttgccc 
gatgcgggga 
ggcgggggga 
ggattgacag 
aattgatcta 
ttttttttta 
gtaacagggg 
cctgcccccc 

gggttgagtt 

agaggggagt 
agtcctgagg 
gaggccacag 



cgtgggcaag 
cctgcctgcc 
cagggaggcc 
gggcgccccc 
cccggcgcca 
cgaaaatctc 
cctccccgaa 
atcttcaaac 
attatcccca 
ccccccacaa 
gaaattggcfc 
ttaaaaacga 
ccctctttct 
cagcaaagag 
ctcccttctc 
ctggctcctc 
agaccaaagg 
ggcacgaccc 
ttatgcatcc 
cacgagcaca 
cacagatgag 
cacagcacac 
aacgcacacg 
cagcacacaa 
cacacacaaa 
acgaacacag 
acacacagca 
gcacagcaca 
acacactcat 
acgttgcaca 
caaaattaga 
caaagtcccg 
tccactgccc 
ggcctgagtg 
aggcctaatt 
gcctgtcccc 
gatacacctt 
ggagatggcc 
cctggaagga 
gtctcctggg 
gtgtgggaag 
gttctggggt 
cagacttgca 
ttccccctct 
cccctcctta 
ggcagtgaca 
gccatttatc 
gagggtggag 
ggagaaggca 
gcggaggggg 
gagggggctc 



ttcttccaat atgacacctg gaagcagtcc 
ctcctgcgtg cccgccgggg tcacgtgctc 
aaacgtcacc gtcccctgat tgctctaccc 
ccagagatgg ccagcaatcg gaagtgagca 
ccatcctgca gcctcctcct gaccacggac 
tcggttccac gtccccctgg ggcttctcct 
acaggctact ctcctcggcc ccctccatcg 
atgtacaaaa tcgattggct ttaaacaccc 
attatcccca cacataaaaa atcaaaacat 
caaccctctt aaaactaatt ggctttttag 
ttaaaaaaaa caaccaccaa aaaaaatcaa 
attggctgag aaacaattgg caaaataaag 
cttctccctt ggac'tttgag tcaaattggc 
ccccagaaat cacaggtggg 
tcagggtaaa ctggccatcc 
ccatcactaa aaatcacaga 
agagcatgaa aaccaaaatc 
atcttacatc tcaattccca 
acacacacaa cacacagcac 
gagcatacag cacacacaca 
acacacaaac gcacagcaca 
cacacacacg cacacacagt 
cgcacagcac acacgcacac 
cacgcacaca catgcacaca 
acacatgcac acacagcaca 
acacacacaa acacagcaca 
cacagcacac agcacacaca 
gcacacacaa acacacagca 
tacatgaaca cagctcacag 
cacctgcaca cacacatgcg 
aagcatatct aagcaactac 
ccaatgtttt catggtctga 
tctgtgccct gcctctcaga 
ggcatttggg atacacccgt 
accatgacag agatgccctt 
cagcgtgagc cacgatgctc 
tgttggccca gccagagtga 
cagacatgcc cccacgagta 
gaccccccca ccggagcaca 
gaaatgtgga gtcctcgggg 
ggaggggtgg gtggggggca 
cccgggagat tttggatgga 
gagaaattag gaacccccaa 
gggcattttt cctttttttt 
ctcagaaacc aaattaaacc 
cgcacgaagc cagcctggag 
gattttgttt ttcatctgtc 
ggcatggcca tgagccccct 
ggcttctggg gacagtggct 
gggtctcctc aggtgtcagg 
tgaggctggc cggaggggaa 



aaaagaaggg 
acgggaattt 
actccctttt 
aggggaggac 
tcacccccaa 
ccgcagctac 
gcacacacac 
cacacagcac 
aaacgcacgg 
cacacacaaa 
acgcacagca 
gacacagcac 
cacacacagc 
cacacatgaa 
catgaacaca 
gcgcagcaca 
cgcaagcacc 
tgaaaacaat 
ctaagattct 
ctcggccctg 
tgcggccggc 
ccatctttcc 
acggcccccg 
acttactgtg 
ggtagctgag 
agatcttggg 
gaccccccaa 
tctggcggtt 
aggaagtggt 
gaattacata 
ttgtttcttg 
gctttatgcg 
aaagcaagaa 
gccctgattg 
cctcctgggg 
tgtccggcct 
gggttgggca 
ctggctggcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



ggggctagca ggtgtgtaaa cagagggttc catcagctgg ggcagggtgg ccgccttccg 3120 

cacacttgag gaaccctccc ctctccctcg gtgacatctt gcccgcccct cagcaccctg 3180 

ccttgtctcc aggaggtccg aagctctgtg ggacctcttg ggggcaaggt ggggtgaggc 3240 

cggggagtag ggaggtcagg cgggtctgag cccacagagc aggagagctg ccaggtctgc 33 00 

ccatcgacca ggttgcttgg gccccggagc ccacgggtct ggtgatgcca tagcagccac 3360 

caccgcggcg cctagggctg cggcagggac tcggcctctg ggaggtttac ctcgccccca 3420 

cttgtgcccc cagctcagcc cccctgcacg cagcccgact agcagtctag aggcctgagg 3480 

cttctgggtc ctggtgacgg ggctggcatg accccggggg tcgtccatgc cagtccgcct 3540 

cagtcgcaga gggtccctcg gcaagcgccc tgtgagtggg ccattcggaa cattggacag 3600 

aagcccaaag agccaaattg tcacaattgt ggaacccaca ttggcctgag atccaaaacg 3660 

cttcgaggca ccccaaatta cctgcccatt cgtcaggaca cccacccacc cagtgttata 3720 

ttctgcctcg ccggagtggg tgttcccggg ctgcctgtct gacctccgtg cctagtcgtg 3780 

gctctccatc ttgtctcctc cccgtgtccc caatgtcttc agtggggggc cccctcttgg 3840 

gtcccctcct ctgccatcac ctgaagaccc ccacgccaaa cactgaatgt cacctgtgcc 3900 

tgccgcctcg gtccaccttg cggcccgtgt ttgactcaac tcagctcctt taacgctaat 3960 

atttccggca aaatcccatg cttgggtttt gtctttaacc ttgtaacgct tgcaatccca 4020 

ataaagcatt aaaagtcatg atcttctgag gtgttccact ctctgacttg ggtactggac 4080 

tgccggaggg agggaagggg ctgagcacct ggaagcaggc agagggggat agaagaggga 4140 

aggggaagga aggcct ~ ~ ~ ~ J 4 - qfi 




nac79g07.xl NCI_CGAP_Brn23 Homo sapiens cDNA clone IMAGE : 3440820 3', 
sequence . 

gaggtttgtg agtcggtttg ttttcttcat ttcagatatt cgagtcatgg aactttttca 
aagaactgaa gggggattcc cggtgatttt tccaccccat acagatgaag acatcaaagg 
gccccgccgc agtgacgaag acaaaagcat gaacagatct gggtccaaag aggctcggca 
ggtgcggtgg gcacgaggga gcgacccccg ggtggccgag ggactgaggg atgcgcgtcc 
agccccggag ggcggcgtcc acctgacccc ccagcccgag ccgccgccgc cgcctcccag 
cccggcctgc gctgccccct ggcggtgggg ccaggagcca tcccagcctc taagcccccg 
accctacctc ggcccccggg gctgccgcgg aagggccccc aaagcctgct catagccaag 
ggacaggtat gtggccaagg ccccccacag ccctgaactg gagtgtgtct gaggctccgg 
caggggcccc ctcacttggg cgcggagccc tgggagtgga ga 



60 
120 
180 
240 
300 
360 
420 
480 
522 



/ trans lafcion= "MRLPRRAALGIJI1PLI1LLLPPAPEAAKKPTPCHRCRGLVDKFNQGM 
TOTAKKNFGGGNTAWEEK^^ 

^QLKSEYPDLPEWFCnna-IJCVCCSPGTyGPDCIACQGGSQRPCSGNGHCSGDGSRQCT 
GSCRCHMGYQGPLCTDCMDGYFSSIiRNETHSICT^ 

IJDEGACVDVDEC^^PPPCSAAQFCKNANGSYTCEECDSSCVGCTGEGPGNCKECISGY 
? J A^HGQCADVDECALAEKTCVRK^ 



FT 
?T 
?T 
?T 
?T 



gctccggctg 


cgtcttcccg 


60 


ggggctcctg 


ccgcttctgc 


120 


gccctgccac 


cggtgccggg 


180 


aaagaagaac 


tttggcggcg 


240 


gtccagcgag 


attcgcctgc 


300 


atgcaaccag 


atgctagagg 


360 


gagcgaatat 


cctgacttat 


420 


tccaggaacc 


tacggtcccg 


480 


cgggaatggc 


cactgcagcg 


540 


catggggtac 


cagggcccgc 


600 


gaacgagacc 


cacagcatct 


660 


gaccaacaga 


gactgcggcg 


720 


ggatgtggac 


gagtgtgcgg 


780 


cgccaacggc 


tcctacacgt 


840 


aggcccagga 


aactgtaaag 


900 


agatgtggac 


gagtgcgcac 


960 


caatactcca 


gggagctacg 


1020 


ctgtgtgccg 


ccggcagagg 


1080 


ccgcgaagac 


ctgtaatgtg 


1140 


gaaaatgtgg 


ccctgaggat 


1200 


gctctctaac 


ggttgattct 


1260 


aacagacttg 


tatattttga 


1320 


aaaaaaaaaa 


aaaaaaaaa 


1379 



Homo sapiens cDNA clone IMAGE: 3952627, partial cds . 

caaaatatct gcatccacct ggagatgcag ctaagtgggt ccttatgtac acaccacgtt 
cacacacaca cagagggacc acgtgtgcac gcatgaccgt gtgggtggcg gcgtttgctg 
tgaaccatgc tcaggccaca cagagacaca tacttggttt ctgggactga gacccaggcc 
tggcaggacc gtgcctacag atactgcaaa cgttcctaca gcctagaggt gcgtatacac 
acccaagtac acgcagccag gcattcaggg gtgtgtttgc cacatggagc atcccttcct 
ggtcttgcca ggcacctgca cagagcgtct ccagccccat ctcctaacgg gggctggggg 
taagagaaat ctaactgcgc tcccccaacc cctcgccctg ccatcttccc ctcaagcctg 420 
ctaagttatc ccaggcctgt gcgtggtgga aaaagccagc cttggccctg cagcctccac 4 80 

ctcgccgctg ggggaccaac aggttgctta cagctttgca ccccggcatc agcacagggg 540 
tccctgcccc accctccggc agctcaggga gtgttttcct gtgaggcctc ccccatcagt fino 
ggaccagagg gagaagcccg atgccccatc ccggctttcc cgtaacgcac aggacacgtg 
tgcaattcat aggaacggcc cagatcgccc tcatgagtgc cacctggtac aggtaggtgg 
cgctcacgtt cctgcccaaa tgcagcccat cggggagtca cagtcagtcc ccccggcccc 
cctcccagtc cctgttggct ttcggtagct ctcgcatgca gttctattaa cagccgtcta 840 
gaagcgatgc tttagtggcc taacccaggg tcaaatacag ctctttctag caaaatcagg qo ° 
cagctctgcc ccatcggtag gggcaccgat tagtctacta acagccagag gtccatctag 
gagggtgccg ggaggagctg agcccccgga ggtgggctcc tggtgacggg tgtccaagaa 
gcggtttcct tgggagcttc tgcctccgtg ggcctctcag cccgccccgt gtggccgccc 
gggtgtggct cagccatgtc ccctccccag gtccttcatt cacccctccc ctccccacag 
tggaattgtt gaagtgtggc gagtctgtgc tcgggacaat aaagcttgtg acaggtccag 
gaccccggca aaaaaaaaaa aa 



60 
120 
180 
240 
300 
360 



600 
660 
720 
780 



900 
960 
1020 
1080 
1140 
1200 
1222 



)E 



Homo sapiens cDNA clone IMAGE : 3952627 , partial cds . 



caaaatatct 
cacacacaca 
tgaaccatgc 
tggcaggacc 
acccaagtac 
ggtcttgcca 
taagagaaat 
ctaagttatc 
ctcgccgctg 
tccctgcccc 
ggaccagagg 
tgcaattcat 
cgctcacgtt 
cctcccagtc 
gaagcgatgc 
cagctctgcc 
gagggtgccg 
gcggtttcct 

gggtgtggct 

tggaattgtt 
gaccccggca 



gcatccacct 
cagagggacc 
tcaggccaca 
gtgcctacag 
acgcagccag 
ggcacctgca 
ctaactgcgc 
ccaggcctgt 
ggggaccaac 
accctccggc 
gagaagcccg 
aggaacggcc 
cctgcccaaa 
cctgttggct 
tttagtggcc 
ccatcggtag 
ggaggagctg 
tgggagcttc 
cagccatgtc 
gaagtgtggc 
aaaaaaaaaa 



ggagatgcag 
acgtgtgcac 
cagagacaca 
atactgcaaa 
gcattcaggg 
cagagcgtct 
tcccccaacc 
gcgtggtgga 
aggttgctta 
agctcaggga 
atgccccatc 
cagatcgccc 
tgcagcccat 
ttcggtagct 
taacccaggg 
gggcaccgat 
agcccccgga 
tgcctccgtg 
ccctccccag 
gagtctgtgc 
aa 



ctaagtgggt 
gcatgaccgt 
tacttggttt 
cgttcctaca 
gtgtgtttgc 
ccagccccat 
cctcgccctg 
aaaagccagc 
cagctttgca 
gtgttttcct 
ccggctttcc 
tcatgagtgc 
cggggagtca 
ctcgcatgca 
tcaaatacag 
tagtctacta 
ggtgggctcc 
ggcctctcag 
gtccttcatt 
tcgggacaat 



ccttatgtac 
gtgggtggcg 
ctgggactga 
gcctagaggt 
cacatggagc 
ctcctaacgg 
ccatcttccc 
cttggccctg 
ccccggcatc 
gtgaggcctc 
cgtaacgcac 
cacctggtac 
cagtcagtcc 
gttctattaa 
ctctttctag 
acagccagag 
tggtgacggg 
cccgccccgt 
cacccctccc 
aaagcttgtg 



acaccacgtt 
gcgtttgctg 
gacccaggcc 
gcgtatacac 
atcccttcct 
gggctggggg 
ctcaagcctg 
cagcctccac 
agcacagggg 
ccccatcagt 
aggacacgtg 
aggtaggtgg 
ccccggcccc 
cagccgtcta 
caaaatcagg 
gtccatctag 
tgtccaagaa 
gtggccgccc 
ctccccacag 
acaggtccag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1222 



PTl.l_07_C06.r tumorl Homo sapiens cDNA 5», mRNA sequence. 

cngggcntgc aggaattctg gnacgagtct gggtccntgg tttctctcca tactcccttc 
cttaggctcc tgaactcgtt tgctcctaaa tcttgttaat tctttttctc tggattttgg 
tttcttttgg ctttcccttg ccttcccctt tctctgtctc caacactctt tccccatgtc 
tttctggctg tctctatgtt cctcttctct tatcctnaac tttctgtcca ttcgggcctc 
ctcccnacct cccacgcccc agcccctccc tccttggtct ccttttcgat atgccaaacc 
aattttgggt cgagtgcatt taacgagaac anaacaaaag gctcataaca acaagaacgt 
ttcagaaaaa aacaaaaagg gtttaaaaaa attgttgagg tcaaaaaagg caaancanta 
anggaantta ngntttcctt gggaaaaaat nnantntaaa aaaanactng gngggggggc 
ccgggtaccc naaattttgg cccnnatnag gtgagccggg nttnncaatt caacttggcc 
ggncgntttt acaaacgnnn ggagccttgg gnaaanccct nngggggtan cccanccttn 
ntncgncatt tnaaggaaaa nttccctntt tnggccagga ttggggaaat tng 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
653 



60 
120 
180 
240 



480 
540 
600 



Homo sapiens cDNA FLJ12940 fis, clone NT2RP2005038, weakly similar to DNA 
NUCLEC5TIDYLEXOTRANSFERASE (EC 2 . 7 . 7 . 31) . CO DWA 

/ 1 rans lat ion= "MLPKRRRARVGSPSGDAASSTPPSTRFPGVAIYLVEPRMGRSRRA 

FLTGLARSKGFRVLDACSSEATHVVMEETSAEEAVSWQERRMAAAPPGCTPPAliLDISW 

LTESLGAGQPVPVEC30n^EVAGPRKGPLSPAWMPAYACQRPTPLTHHNTGLSEALElL 

AEAAGFEGSEGRIiLTFCRAASVLKALPSPVTTLSQLQGIiPHFGEHSSRVVQEIiIjEHGVC 
EEVERVRRSESSSPRSSGSV" 

actcactggg gcttccttcc gtctcgctcg gagtttccct ctgcgttcgc tccgcgctgc 
tggaggctgt cgtcccaatg ctccccaaac ggcggcgagc gcgggtcggg tcccctagcg 
gcgatgccgc ttcctccacg ccgccctcga cgcgcttccc gggagtcgcc atctacctgg 
tcgagcctcg catgggtcgc agccgccggg ccttcctcac aggcctggcg cgctccaaag 

gcttccgcgt ccttgacgcc tgcagctccg aagcgacaca tgttgtgatg gaagagacct 300 

cagcagagga ggccgtcagc tggcaggagc gcaggatggc agctgctccc ccgggttgca 360 

t f9 ct 99 ac ataagctggt taacagagag cctgggagct gggcagcctg 420 
tacctgtgga gtgccggcac cgcctggagg tggctgggcc aaggaagggg cctctgagcc 
cagcatggat gcctgcctat gcctgccagc gccctacgcc cctcacacac cacaacactg 
gcctctccga ggctctggag atactggccg aggcagcagg ctttgaaggc agtgagggcc 

gcctcctcac cttctgcaga gcagcctcgg tgctcaaggc ccttcccagc cctgtcacaa 660 

ccctgagcca gctgcagggg cttccccact ttggagaaca ctcctctagg gttgtccagg 720 

agctgctgga gcatggagtg tgtgaggagg tggagagagt tcggcgctca gagagctctt 780 

cacccagatc ttcggggtcg gtgtgaagac tgctgaccgg tggtaccggg aaggactgcg 840 

aaccttagat gacctccgag agcagcccca gaaactaacc caacagcaga aagcggggct 900 

ccagcaccac caggacctga gcaccccagt cctgcggtcc gatgtagatg ccctgcagca 960 

ggtggtggag gaagctgtgg ggcaggccct gcctggggcc accgtcacgc tgaccggcgg 1020 

cttccgcagg gccatgacgt ggacttcctc atcacccacc ccaaggaggg tcaggaggcg 1080 

gggctgctgc ctagagtgat gtgccgcctg caggaccagg gcctcatcct gtaccaccag 114 0 

ttlltf. CaCa 9 ^9 c ^ fc 9 a 9tcccctacc cgcctggccc aacagagcca catggacgct 1200 

tttgagagaa gtttctgcat tttccgccta ccacaacctc caggggctgc tgtgggggga 1260 

111^*° C tlf CCCatC ct 93aaggcc gtgagagtgg acttggtagV tg^alccgc llto 

agccagttcc ctttcgccct gctcggttgg actggctcca agcttttcca gcgggagctg 1380 

cgccgcttca gccggaagga gaagggcctg tggctgaaca gccatgggct gtttgacccg i 440 
gagcagaaga catttttcca agcggcttca gaggaagaca tcttcagaca cctgggcctt 
gagtaccttc ctccagagca gagaaacgcc tgagcctgcc tgtgtcccca acttccactc 
aggaaattgg gctgccccca acctggccac tgaatgtctc caggcagata tgctgccccc 

S a oo C r aC <* tcacccct ccccgccaag gcctggctct tccggaggtc aattgtgcct 1680 

foSSoo 39 " 9ag "= Ct 9 ct 93t3tgc tgcagggtgt gatgaggtgg gagccctcag 1740 

tgccagcctc atcactgtgt gaccctgggt ctgctcttag cctccccatg gctcacgttc 1800 
ctgccctgga tgggatgtga gcggggccca catcgtggag ctgtggtggg gcctgcagtc 
atgaatggca agtggtccct gatgtgcagt gtcccattag ttgcactgca gttaactgtg 
gctcctgcag ggcaccctgc ccagaatgcc cagaagagaa ccatgcatac ctgcactgca 
tttgagagcc atgagctgga ggctgtggtt cgtgccagca aggagcctac tgtctggtgt 

gctgtaggca tctggagagg gagagggcct gggtaggagc tgggaggaag ataattttca 2100 

actatggggc ttcagtactg cagcgccccg agccaggctc tgtgcttctg cctttaaggc 2160 

ttftt, :tCa ? cacaat 9 tct caaaaatagg tcatatcctg ccactcccgt cgcagagccc 2220 

tttaatggtt ccaaacccta agtccacaca tagcccctgg ctctggcatc tctccagccc 2280 

cactggcccc gagctgcttg actcaccggc ttcctatttg atgcacccag gcccccttgt 2340 

ggccaactcc ctccccttct cactgaggca gaagcactga ggtgggctgg acatgggtgc 2400 

cctccacgtc ccccatatcc ccaggcacac tctggcctca ggttttgccc tggccSgL 2460 

atctacctgg agtgggccct ccccttcttc aggccttgaa tcaaaagcca ctttgttagg 2520 
cgaggatttc ccagaccact catcacatta aaaaatattt tg 



1500 
1560 
1620 



1860 
1920 
1980 
2040 



2562 



np60h03.sl NCI_CGAP_Br2 Homo sapiens cDNA clone IMAGE : 1130741 3', raRNA 
sequence. 

atggtgttcc ctgagcggtt gctgcgggtg atggatactc ttctgatact ggctcttcgt 
gctataattt cttttctcac caagagcagg tgccctttca gaagggaatg ggagtggagg 
gagggtcaca gaaacacctc ggcactgggg gaaacgtggc ctagcctctg gcgacggcga 
gcagcggccg gaagcgacgg gggctgcggg ccggcgcggg ttcagaggct tctttttccg 
cggacggaga cactgtacag cacaacctcg ggaaaacgcc aacgccgacg ccttctccaa 
caaaagatgg cctcggactc aagagtgcgg ctccagggca atgcagcccc aacctaaaga 
tttagaggcc tcccgtttcg ctggccccca gagccgccca ccgggactgc acttccccac 
cgataaaagg tggtttccag ggtacctccc tcagatggcg gcggcggctc ccgacggctt 
actcaccagc atccttcgcg ggcgggggct ctcggcaagg cggcctcgtg ccgaatcc 



60 
120 
180 
240 
300 
360 
420 
480 
538 



2 ™TL S ^ iG ? S f LL1 - fused 9e n e from chromosome lq, mRNA (cDNA clone 
3E IMAGE:2823316) . 

ggaagctatg agggaccctg tgagtagcca gtacagttcc tttcttttct ggaggatgcc 60 

catcccagaa ctggatctgt cggagctgga aggcctgggt ctgtcagata cagccaccta 12 0 

caaggtcaaa gacagcagcg ttggcaaaat gatcgggcaa gcaactgcag cagaccagga 180 

gaaaaaccct gaaggtgatg gcctccttga gtacagcacc ttcaacttct ggagagctcc 240 

cattgccagc atccactcct tcgaactgga cttgctctaa ggccaagact tctctctccc 300 

atcaccttgc cctcattgtc ttccctctca agccccttcc tttccactcc tttcccattt 360 

taatcttgtt ctctccctac tgtgttggtg gtgctgatga atctgccaga gttgagttct 420 

atgtatttat ttatctatct gtctactcca tttctctcaa aagccctcaa gtcacaaagt 480 

aaatggttca agcaatggag tactgggtca cagggattcc tcctttcccc cccaaatatt 540 

aactccagaa actaggcctg actggggaca cctgagagta gtatagtagt gcaaaatgga 600 

agactgattt ttgactctat tataatcagc ttcagagatt ccttaaacct tcctaatttc 660 

ctgctccagg gcagtaaaca caaatatttc ttcaaggggt gatgaaaacc tcggaagttt 720 

taatttgagg ttatctgcta cgaaacagta tttctaaaag gctaaagtga taagtctctt 780 

gctttttttt gatcctgctc ttatattctt ttttttcctc agagaaatca ggagggtagt 840 

tagaggtata aaacaggagg aaatattatg gaaaatgaaa atagggaaaa taattgaatc 90 0 

attttagaag tagctaattt cttttctcaa aagagtgtcc cttcttcaca cctactcact 960 

ttacaacttt gctcctaact gtgggttgaa aactctagct aaagaaagtt atcaaatctt 1020 

aacatgcatt cctactatta tgatagtttt taaggtttca attcaatctt ctgaacggca 1080 

taagtcctat tttagcctta cctcctgcat ttgcaatacg taatactgat cagtgggcac 1140 

agttcttcag ctacattgag accctgaaat gaacaattat actctgactc gacatcttgt 1200 

ccccaatcct tccaaaaata ttgatggtga tttgtgctac catttactcg tttatttaat 1260 

aaagacattc aatcccagga aaaaaaaaaa aaaaaaaaaa aa " 13 0 2 



Human mRNA for acetyl -coenzyme A transporter, complete cds. 

acetyl -coenzyme A transporter. 

/ trans lation= "MSPTISHKDSSRQRRPGNFSHSIjDMKSGPLPPGGWDDSHIiDSAGR 
EGDREAIiLGDTGTGDFLKAPQSFRAELSS ILLIJjFLYVLQGIPLGIAGS IPLILQSKNV 

SYTDQAFFSFVFWPFSLKLIjWAPLVDAVYVKNFGR 

RLIiGNTDDRTPDVIALTVAFFIiFEFLAATQDIAVDGWALT^SRE 

AGYFLGNVIiFIjALESADFCNKYLRFQPQPRGIVTLSDFL 

NE VS WKE ETQG I TDTYKLLFAI I KMP AVLTFCLL I LTAKI GF S AAD AVTGLKLVE EGV 
PKEHXiAJjIiAVPMVPLQ I ILPLI I SKYTAGPQPIJ^FYKAMPYRIaLLGIiE YALLWWTPK 
VEHQGGFP I YYYIVVIiljS YALHQVTVY SMYVS IMAFNAKVSDPIiIGGTYMTLIjNTVSNL 
GGNWPSTVALWLVDPLTVKECVGASNQNCRTPDAVELCKKL 
VFIGFGWWFFIiGPKFKKLQDEGSSSWKCKRNN" 

gaattcgcag cgagagctgg aggtgttggg tcgggagacc agccattcga tcccgccgca 60 
ggtaggagct ggtttccatc ctggcaccac ggcacacacc tccagcctcg agcccggcgc 120 
tgctgcccgg gggtctcctt caggctcttt gacgccgttc cagggggcac ctatccaggc 180 
atcctctggg cctctagcca gaggactggc tcccggcttc agcactccgg gctgcagtaa 540 
gaagtgccct tatcgctctg agccctgcca ccatcccgtg aaccaccgaa accctggtcc 
agcgcgacag ccttggacct gggactggac ggatccaaaa cgctcagcct cggcccccca 
cagacggggc tctgcatcgt ctctgatatg tcacccacca tctcccacaa ggacagcagc 
cggcaacggc ggccagggaa tttcagtcac tctctggata tgaagagcgg tcccctgccg 
ccaggcggtt gggatgacag tcatttggac tcagcgggcc gggaagggga cagagaagct 
cttctggggg ataccggcac tggcgacttc ttaaaagccc cacagagctt ccgggccgaa 
ctaagcagca ttttgctact actctttctt tacgtgcttc agggtattcc cctgggcttg 
gcgggaagca tcccactcat tttgcaaagc aaaaatgtta gctatacaga ccaagctttc 
ttcagttttg tcttttggcc cttcagtctc aaattactct gggccccgtt ggttgatgcg 
gtctacgtta agaacttcgg tcgtcgcaaa tcttggcttg tcccgacaca gtatatacta 
ggactcttca tgatctattt atccactcag gtggaccgtt tgcttgggaa taccgatgac 
agaacacccg acgtgattgc tctcactgtg gcgttctttt tgtttgaatt cttggccgcc 
actcaggaca ttgccgtcga tggttgggcg ttaactatgt tatccaggga aaatgtgggt 
tatgcttcta cttgcaattc ggtgggccaa acagcgggtt actttttggg caatgttttg 1080 
tttttggccc ttgaatctgc cgacttttgt aacaaatatt tgcggtttca gcctcaaccc 1140 
agaggaatcg ttactctttc agatttcctt tttttctggg gaactgtatt tttaataaca 1200 
acaacattgg ttgcccttct gaaaaaagaa aacgaagtat cagtagtaaa agaagaaaca 1260 
caagggatca cagatactta caagctgctt tttgcaatta taaaaatgcc agcagttctg 1320 
acattttgcc ttctgattct aactgcaaag attggttttt cagcagcaga tgctgtaaca 1380 
ggactgaaat tggtagaaga gggagtaccc aaagaacatt tagccttatt ggcagttcca 1440 
atggttcctt tgcagataat actgcctctg attatcagca aatacactgc aggtccccag 1500 
ccattaaaca cattttacaa agccatgccc tacagattat tgcttgggtt agaatatgcc 1560 
ctactggttt ggtggactcc taaagtagaa catcaagggg gattccctat atattactat 1620 
atcgtagtcc tgctgagtta tgctttacat caggttacag tgtacagcat gtatgtttct 1680 
ataatggctt tcaatgcaaa ggttagtgat ccacttattg gaggaacata catgaccctt 1740 
ttaaataccg tgtccaatct gggaggaaac tggccttcta cagtagctct ttggcttgta 1800 
gatcccctca cagtaaaaga gtgtgtagga gcatcaaacc agaattgtcg aacacctgat I860 
gctgttgagc tttgcaaaaa actgggtggc tcatgtgtta cagccctgga tggttattat 1920 
gtggagtcca ttatttgtgt tttcattgga tttggttggt ggttctttct tggtccaaaa 1980 
tttaaaaagt tacaggatga aggatcatct tcgtggaaat gcaaaaggaa caattaatat 2040 
atatgctact ggacattcta gcaaggtaat tgtagtttag ttttaattcg gagagcaatg 2100 
ataatcagtg cacaggagta taaaatatta ttttaaacag cgaaattaat aatataaaat 2160 
gccaaatggt tgaaaaaata gaaacctttc tgtatatttg atcatatttt ttttttgcct 2220 
tgtcaatgta tttaaagttt acttaaggtc aggaaattct aaaacaactt ttctggcctt 2280 
gttatttgat gtatatcttt taaatttact gaccaaagca tgttttaagc tgcaatgcag 
tagtcacggg tggtaaccat gtagtcaggt attgttatta gtacctatca ctgctgagct 
gtat£taaaa ttttggtaca atatataaaa tggagaagag cttgatattc aggtactaac 
cacaactagt ctgacattgt tggcagttaa aatcttattt tgaattgtaa attagttaaa 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



2340 
2400 
2460 
2520 



ttttatgtgg aatttgctga gaaaagaata tagactactg aaatgtcatt ttagttattt 
ttcttatgac cacattgtac aaatgaatct gtgttaaaaa gactatttta aatgtatttc 
ctgcttttgt aagcattaaa gatttgaatt ccaccacact gg 




Homo sapiens SDF2L1 mRNA for SDF2 like protein 1, complete cds. 

/ 1 r an s 1 a t i on= " MWS AGRGGAAWPVLLGLLLAIjLVPGGGA 

NTHHRVRLHSHDIKyGSGSGQQSVTGVEASDDANSYWRIRGGSEGGCPCGSPVRCGQAV 

RLTHVXiTGKNIjHTHHFPSPL snnqevsafgedgegddldlwtvrcsgqhwereaavrlq 

HVGTSVFLSVTGEQYGSPIRGQHEVHGMPSANTHNTWKAMEGIFIKPSVEPSAGHDEL" 

gctggagccg ggccggggcg atgtggagcg cgggccgcgg cggggctgcc tggccggtgc 
tgttggggct gctgctggcg ctgttagtgc cgggcggtgg tgccgccaag accggtgcgg 
agctcgtgac ctgcgggtcg gtgctgaagc tgctcaatac gcaccaccgc gtgcggctgc 
actcgcacga catcaaatac ggatccggca gcggccagca atcggtgacc ggcgtagagg 
cgtcggacga cgcgaatagc tactggcgga tccgcggcgg ctcggagggc gggtgcccgt 
gcgggtcccc ggtgcgctgc gggcaggcgg tgaggctcac gcatgtgctt acgggcaaga 
acctgcacac gcaccacttc ccgtcgccgc tgtccaacaa ccaggaggtg agtgcctttg 
gggaagacgg cgagggcgac gacctggacc tatggacagt gcgctgctct ggacagcact 
gggagcgtga ggctgctgtg cgcttacagc atgtgggcac ctctgtgttc ctgtcagtca 
cgggtgagca gtatggaagc cccatccgtg ggcagcatga ggtccacggc atgcccagtg 
ccaacacgca caatacgtgg aaggccatgg aaggcatctt catcaagcct agtgtggagc 
cctctgcagg tcacgatgaa ctctgagtgt gtggatggat gggtggatgg agggtggcag 
gtggggcgtc tgcagggcca ctcttggcag agactttggg tttgtagggg tcctcaagtg 
cctttgtgat taaagaatgt tggtctatga 
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120 
180 
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5E Homo sapiens RTN2-A (RTN2) mRNA, complete cds. 



?T 
?T 

T1 
?T 
?T 



?T /translation- "MGQVLPVFAHCKEAPSTASSTPDSTEGGNDDSDFRELHTAREFSE 

?T EDEEETTSQDWGTPRELTPSY1AFDGWGSGGRRDSTARRPRPQGRSVSEPRDQHPQPS 

IiGDSLESIPSIjSQSPEPGRRGDPDTAPPSERPLEDIjRIiRIiDHLGWVARGTGSGEDSSTS 
SSTPLEDEEPQEPNRLETGEAGEELDLRLRLAQPSSPEV1,TPQLSPGSGTPQAGTPSPS 
RSRDSNSGPEEPLLEEEEKQWGPLEREPVRGQCLDSTDQLEFTVEPRLLGTAMEWLKTS 
Iil^VYKTVPILELSPPLWTAIGWVQRGPTPPTPVLRVLLKWAKSPRSSGVPSLSLGAD 
MGSKVADLLYWKDTRTSGWFTGLMVSLLCIiHFSIVSVAAHIALLLLCGTISLRVYRK 
VLQAVHRGDGANPFQAYLDVDLTLTREQTERLSHQITSRVVSAATQLRHFFLVEDLVDS 
LKLAUjFV ILTFVGAI FNGLTLLILGVIGLFT I PLLYRQHQAQ IDQ YVGL VTNQLSH I K 
AKIRAKIPGTGALASAAAAVSGSKAKAE » 

cccgggagga ggaggcggcg agaatggcag cggcgtcgtg ggcgcggcgg agatgagcgc 60 

ccgcgacccc gggcccaggg cggcacagcc ggagtgggcg ggggtcccga tgcaggcccg 12 0 

aggggggcca tggggcaggt cctgccggtc ttcgcccact gcaaagaagc tccgtctaca 18 0 

gcctcctcaa ctcctgattc cacagaagga gggaacgacg actctgattt tcgagagctg 240 

cacacagccc gggaattctc agaggaggac gaggaggaga ccacgtcgca ggactggggc 300 

accccccggg agctgacctt ctcctacatc gcctttgatg gtgtagtggg ctccgggggc 360 

cgcagggatt caactgcccg ccgcccccgc ccccagggcc gctcagtctc ggaaccacga 420 

gaccagcacc ctcagcccag cctgggcgac agcttggaga gcatccccag cctgagccaa 480 

tccccggagc ctggacgacg gggtgatcct gacaccgcgc ctccatccga gcgccctctg 540 

gaagacctga ggcttcggtt ggaccatctg ggctgggtgg cccggggaac gggatccggg 600 

gaggactctt ccaccagcag ctccaccccg ctggaagacg aagaacccca agaacccaac 660 

agattggaga caggagaagc tggggaagaa ctggacctac gactccgact tgctcagccc 72 0 

tcatcgcccg aggtcttgac tccccagctc agtccgggct ctgggacacc ccaggccggt 780 

actccgtccc catcccgatc gcgagattcg aactctgggc ccgaagagcc attgctggaa 840 

gaggaagaaa agcagtgggg gccactggag cgagagccag taaggggaca gtgcctcgat 900 

agcacggacc aattagaatt cacggtggag ccacgccttc taggaacagc tatggaatgg 960 

ttaaagacat cattgctttt ggctgtttac aagacggttc caattttgga attgtcccca 1020 

cctctgtgga cagccattgg ctgggtccaa aggggcccca ccccccctac tcctgtcctc 1080 

cgggttctac tgaagtgggc aaaatccccg agaagcagcg gtgtccccag cctctcactc 114 0 

ggagccgata tggggagtaa agtggcggac ctgctgtact ggaaggacac gaggacgtca 12 00 

ggagtggtct tcacaggcct gatggtctcc ctcctctgcc tcctgcactt tagcatcgtg 1260 

tccgtggccg cgcacttggc tctgttgctg ctctgcggca ccatctctct cagggtttac 1320 

cgcaaagtgc tgcaggccgt gcaccggggg gatggagcca accctttcca ggcctacctg 1380 

gatgtggacc tcaccctgac tcgggagcag acggaacgtt tgtcccacca gatcacctcc 1440 

cgcgtggtct cggcggccac gcagctgcgg cacttcttcc tggtagaaga cctcgtggat 1500 

tccctcaagc tggccctcct cttctacatc ttgaccttcg tgggtgccat cttcaatggt 1560 

ttgactcttc tcattctggg agtgattggt ctattcacca tccccctgct gtaccggcag 1620 

caccaggctc agatcgacca atatgtgggg ttggtgacca atcagttgag ccacatcaaa 1680 

gctaagatcc gagctaaaat cccagggacc ggagccctgg cctctgcagc agccgcagtc 1740 

tccggatcca aagccaaagc cgaatgagaa cggtgtctct gcccgcagga cgcctgcccc 1800 

cagcccccgc agccctctgg ccccctccat ctcttgtccg ttcccaccca cccccctcct i860 

cggcccgagc cttttcccgg tgggtgtcag gatcactccc actagggact ctgcgctaat 1920 

tacctgagcg accaggacta catttcccaa gaggctctgc tccaggagtc caggaaagac 1980 

gaggcacctt ggccgcgggg cctgctggga cttgtagttg cctagacagg gcaccaccct 2040 

gcacttccgg acccgccgct ggaggcgccg tgaggcgttg gtgtctcctg gatgctacta 2 100 

gccccaacgc cggggctttg catggggccc aggggaggcc tgagcttgga tttacactgt 2160 
aataaagact cctgtggaaa aaaaaaaaaa 



2190 



Homo sapiens cDNA: FLJ22209 fis, clone HRC01496. 

cgatgatgag gctgaagaaa aggaagacaa agaagaagaa aaagaaaaag aagagaaaga 
gtcggaagac aaacctgaaa ttgaagatgt tggttctgat gaagaagaag aaaagaaacc 
aaagactaaa aaagttgaaa aaactgtctg ggactgggaa cttatgaatg atatcgttca 
taaactttcc tatttatgta tggagcagca agactgaaac tgttgaggag cccatggagg 
aagaagaagc agccaaagaa gagaaagaag aatctgatga tgaagctgca gtagaggaag 
aagaagaaga aaagaaacca aagactaaaa aagttgaaaa aactgtctgg gactgggaac 
ttatgaatga tatcgttcat aaactttcct atttatgtat ggagcagcaa gactgaaact 
gttgaggagc ccatggagga agaagaagca gccaaagaag agaaagaaga atctgatgat 
gaagctgcag tagaggaaga agaagaagaa aagaaaccaa agactaaaaa agttgaaaaa 
actgtctggg actgggaact tatgaatgat atcgttcata aactttccta tttatgtatg 
gagcagcaag actgaaactg ttgaggagcc catggaggaa gaagaagcag ccaaagaaga 
gaaagaagaa tctgatgatg aagctgcagt agaggaagaa gaagaagaaa agaaaccaaa 
gactaaaaaa gttgaaaaaa ctgtctggga ctgggaactt atgaatgata tcgttcataa 
actttcctat ttatgtatgg agcagcaaga ctgaaactgt tgaggagccc atggaggaag 
aagaagcagc caaagaagag aaagaagaat ctgatgatga agctgcagta gaggaagaag 
aagaagaaaa gaaaccaaag actaaaaaag ttgaaaaaac tgtctgggac tgggaactta 
tgaatgatat cgttcataaa ctttcctatt tatgtatgga gcagcaagac tgaaactgtt 
gaggagccca tggaggaaga agaagcagcc aaagaagaga aagaagaatc tgatgatgaa 
gctgcagtag aggaagaaga agaagaaaag aaaccaaaga ctaaaaaagt tgaaaaaact 1140 
gtctgggact gggaacttat gaatgatatc gttcataaac tttcctafctt atgtatggag 1200 
cggcaagact gaaactgttg aggagcccat ggaggaagaa gaagcagcca aagaagagaa 1260 
agaagaatct gatgatgaag ctgcagtaga ggaagaagaa gaagaaaaga aaccaaagac 1320 
taaaaaagtt gaaaaaactg tctgggactg ggaacttatg aatgatatcg ttcataaact 1380 
ttcctattta tgtatggagc agcaagactg aaactgttga ggagcccatg gaggaagaag 1440 
aagcagccaa agaagagaaa gaagaatctg atgatgaagc tgcagtagag gaagaagaag 
aagaaaagaa accaaagact aaaaaagttg aaaaaactgt ctgggactgg gaacttatga 
atgatatcgt tcataaactt tcctatttat gtatggagca gcaagactga aactgttgag 
gagcccatgg aggaagaaga agcagccaaa gaagagaaag aagaatctga tgatgaagct 
gcagtagagg aagaagaaga agaaaagaaa ccaaagacta aaaaagttga aaaaactgtc 
tgggactggg aacttatgaa tgatatcgtt cataaacttt cctatttatg tatggagcag 
caagactgaa actgttgagg agcccatgga ggaagaagaa gcagccaaag aagagaaaga 
agaatctgat gatgaagctg cagtagagga agaagaagaa gaaaagaaac caaagactaa 
aaaagttgaa aaaactgtct gggactggga acttatgaat gatatcgttc ataaactttc 
ctatttatgt atggagcagc aagactgaaa ctgttgagga gcccatggag gaagaagaag 
cagccaaaga agagaaagaa gaatctgatg atgaagctgc agtagaggaa aaaaaaaaaa 
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1500 
1560 
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1680 
1740 
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Homo sapiens UDP-N-acetylglucosamine-2-epimerase mRNA, complete cds. 

/ trans lation= "MEKNGNNRKLRVCVATCNRADYSKtiAPIMFGIKTEPEFFELDVVV 
LGSHIiIDDYGNTYIWIEQDDFDINTRLHTIVRGEDEAAMVESVG1jALVKLPDV1j1IRIiKP 
DIMIVHGDRFDAIA^TSAALMNIRILHIEGGEVSGTIDDSIRHAITKLAHYHVCCTRS 
AEQHL I SMCEDHDRILIAGCPSYDKLLSAKNKDYMS I IRMWLGDDVKSKD YI VALQHPV 

TTDIKHSIKMFELTU3ALISFNKRTLVLFPNIDAGSKEMVRVMRKKGIEHHPNFRAVKH 
VPFDQFIQLVAHAGCMIGNSSCGVREVGAPGTPVINLGTRQIGRETGENVLHVRDADTO 
DKILQALHLQFGKQYPCSKIYGDGNAVPRILKFLKSIDLQEPLQKKFCFPPVKENISQD 
^?^^^^^^^^^^ i ^^'^^^^'^ I ^*^ V ' ? ^^^^^ < ^^ ^ VKKYXQBTsTPKT YEER INXi I HjQMCTVEAAA. 
EAVKI^CRILGVGISTGGRVNPREGIVLHSTKLIQEWNSVDLRTPLSDTLHLPVWVDND 
GNCAALAERKFGQGKGLENFVTLITGTGIGGGIIHQHELIHGSSFCAAELGHLVVSLDG 
PDCSCGSHGCIEAYASGMALQREAKKLHDEDLLLVEGMSVPKDEAVGALHLIQAAKLGN 
AKAQS ILRTAGTALGLGVVNILHTMNPSLVILSGVIiASHYIHIVKDVIRQQAIjS s vqdv 
DVWSDLVDPALLGAASMVLDYTTRRIY" 

cggcgtctgg aactctattt tagaacctct caaaacgaaa caagcaaatc atggagaaga 
atggaaataa ccgaaagctg cgggtttgtg ttgctacttg taaccgtgca gattattcta 
aacttgcccc gatcatgttt ggcattaaaa ccgaacctga gttctttgaa cttgatgttg 
tggtacttgg ctctcacctg atagatgact atggaaatac atatcgaatg attgaacaag 
Ca * taaca = c aggctacaca caattgtgag gggagaagat gaggcagcca 
tggtggagtc agtaggcctg gccctagtga agctgccaga tgtccttaat cgcctgaagc 
ctgatatcat gattgttcat ggagacaggt ttgatgccct ggctctggcc acatctgctg 
ccttgatgaa catccgaatc cttcacattg aaggtgggga agtcagtggg accattgatg 
actctatcag acatgccata acaaaactgg ctcattatca tgtgtgctgc acccgcagtg 
cagagcagca cctgatatcc atgtgtgagg accatgatcg catccttttg gcaggctgcc 
cttcctatga caaacttctc tcagccaaga acaaagacta catgagcatc attcgcatgt 
ggctaggtga tgatgtaaaa tctaaagatt acattgttgc actacagcac cctgtgacca 
ctgacattaa gcattccata aaaatgtttg aattaacatt ggatgcactt atctcattta 
acaagcggac cctagtcctg tttccaaata ttgacgcagg gagcaaagag atggttcgag 
2£2E2£ faagggcatt gagcatcatc ccaactttcg tgcagttaaa cacgtcccat 900 
ttgaccagtt tatacagttg gttgcccatg ctggctgtat gattgggaac agcagctgtg 960 
gggttcgaga agttggagct tttggaacac ctgtgatcaa cctgigaaca cgtcaga?t| xoll 
gaagagaaac aggggagaat gttcttcatg tccgggatgc tgacacccaa gacaaaatat 
tgcaagcact gcaccttcag tttggtaaac agtacccttg ttcaaagata tatggggatg 
gaaatgctgt tccaaggatt ttgaagtttc tcaaatctat cgatcttcaa gagccactgc 
aaaagaaatt ctgctttcct cctgtgaagg agaatatctc tcaagatatt gaccatattc 
ttgaaactct aagtgccttg gccgttgatc ttggcgggac gaacctccga gttgcaatag 
tcagcatgaa gggtgaaata gttaagaagt atactcagtt caatcctaaa acctatgaag 
llllll I "* aatccta cagatgtgtg tggaagctgc agcagaagct gtaaaactga 
tttgggagta ggcatttcca caggtggccg tgtaaatcct cgggaaggaa 
!^ aac ° aaa ctgatccaag agtggaactc tgtggacctt aggacccccc 
tttctgacac tttgcatctc cctgtgtggg tagacaatga tggcaactgt gctgccctgg 
cggaaaggaa atttggccaa ggaaagggac tggaaaactt tgttacactt atcacaggca 
caggaatcgg tggtggaatt atccatcagc atgaattgat ccacggaagc tccttctgtg 
ctgcagaact gggccacctt gttgtgtctc tggatgggcc tgattgttcc tgtggaagcc 
atgggtgcat tgaagcatac gcctctggaa tggccttgca gagggaggca aaaaagctcc 
atgatgagga cctgctcttg gtggaaggga tgtcagtgcc aaaagatgag gctgtgggtg 
cgctccatct catccaagct gcgaaacttg gcaatgcgaa ggcccagagc atcctaagaa 
cagctggaac agctttgggt cttggggttg tgaacatcct ccataccatg aatccctccc 
ttgtgatcct ctccggagtc ctggccagtc actatatcca cattgtcaaa gacgtcattc 
lllTLl qS< : ctt 9 tcctcc gtgcaggacg tggatgtggt ggtttcggat ttggttgacc 2160 
ccgccctgct gggtgctgcc agcatggttc tggactacac aacacgcagg atctactaga 2220 
Ca9a ! at " a cct tctctcc agagctccgt agtggaatca agttcttgtc 2280 
tttaggagta gcctttctta acaatcaaat ctgtattgaa ctgacggtga ctttggcaga 2340 
gaatgtttca cttttgtctc ctcttccaga gtcaccttcc ccactcta 99 ca 9 a 2340 
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2388 




Homo sapiens carcinoenibryonic antigen 2a (CGM2) mRNA, complete cds. 

/translation^ "^qspsaCPYRVCIPWQGLIjLTASIJjTFWI^PNSAQTNIDGVPFNV 
AEGKEVLLVVHNESQNLYGYNWYKGQRVHANYRIIGYVKNI 

NGTLL IQNVTHNDAGFYTIiHVIKENLVNEEVTRQFYVFSBP PKPS ITSNNFNPVENKD I 
GASRSDPVTLNVCYESVQASSPDIiSAGTAVSIMIGVIiAGiyiALI " 

gccatgggtt ccccttcagc ctgtccatac agagtgtgca ttccctggca ggggctcctg 
ctcacagcct cgcttttaac cttctggaac ctgccaaaca gtgcccagac caatattgat 
ggtgtgccgt tcaatgtcgc agaagggaag gaggtccttc tagtagtcca taatgagtcc 
cagaatcttt atggctacaa ctggtacaaa gggcaaaggg tgcatgccaa ctatcgaatt 
ataggatatg taaaaaatat aagtcaagaa aatgccccag ggcccgcaca caacggtcga 
gagacaatat accccaatgg aaccctgctg atccagaacg tcacccacaa tgacgcagga 
ttctataccc tacacgttat aaaagaaaat cttgtgaatg aagaagtaac cagacaattc 
tacgtattct cggagccacc caagccctcc atcaccagca acaacttcaa tccggtggag 
aacaaagata ttgtggtttt aacctgtcaa cctgagactc agaacacaac ctacctgtgg 
tgggtaaaca atcagagcct cctggtcagt cccaggctgc tgctctccac tgacaacagg 
accctcgttc tactcagcgc cacaaagaat gacataggac cctatgaatg tgaaatacag 
aacccagtag gtgccagccg cagtgaccca gtcaccctga atgtctgcta tgagtcagta 
caagcaagtt cacctgacct ctcagctggg accgctgtca gcatcatgat tggagtactg 
gctgggatgg ctctgatata gcag 
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>E yh42all.rl Scares placenta Nb2HP Homo sapiens cDNA clone IMAGE: 132380 5' 
»E mRNA sequence. 



ggtttttaca agagtaacac atttaaattt acagaggtaa gaatttcctt ggagaaatag 60 

gtgctggtga taataggagt atctttcttt tccatatcaa cataattata ataaataact 120 

cacagattta aaggcttatt ttgtgccagg cattctgctg agtgctttac atacatgtct 180 

catgtaatcc tcccaacagc tctgcaggga caggagttta tgattatctt gattttatag 240 

gaataggtaa tgtaatgctc agagagggtt aaacatctgg gttaggtcac acaggctaat 3 00 

ccaatactta ggttttaagg ttttgaggac tgggggtgcn gtgggctcca cggcctgtaa 360 

tccccnggca ctttggggga ggcntaggcc gggnccggtc cccggggtcn ggggtcccng 420 

gcccctccgg ~ " 430 



Homo sapiens immediate early response 3, transcript variant short, mRNA 
(CDNA clone MGC:5118 IMAGE : 3457670) , complete cds r 




ctccgctcgg ctcaccatgt gtcactctcg cagctgccac ccgaccatga ccatcctgca 
ggccccgacc ccggccccct ccaccatccc gggaccccgg cggggctccg gtcctgagat 
cttcaccttc gaccctctcc cggagcccgc agcggcccct gccgggcgcc ccagcgcctc 
tcgcgggcac cgaaagcgca gccgcagggt tctctaccct cgagtggtcc ggcgccagct 
gccagtcgag gaaccgaacc cagccaaaag gcttctcttt ctgctgctca ccatcgtctt 
ctgccagatc ctgatggctg aagagggtgt gccggcgccc ctgcctccag ^ggacgcccc 
taacgccgca tccctggcgc ccacccctgt gtcccccgtc ctcgagccct ttaatctgac 
ttcggagccc tcggactacg ctctggacct cagcactttc ctccagcaac acccggccgc 
cttctaactg tgactccccg cactccccaa aaagaatccg aaaaaccaca aagaaacacc 
aggcgtacct ggtgcgcgag agcgtatccc caactgggac ttccgaggca acttgaactc 
agaacactac agcggagacg ccacccggtg cttgaggcgg gaccgaggcg cacagagacc 
gaggcgcata gagaccgagg cacagcccag ctggggctag gcccggtggg aaggagagcg 
tcgttaattt atttcttatt gctcctaatt aatatttata tgtatttatg tacgtcctcc 
taggtgatgg agatgtgtac gtaatattta ttttaactta tgcaagggtg tgagatgttc 
cccctgctgt aaatgcaggt ctcttggtat ttattgagct ttgtgggact ggtggaagca 
ggacacctgg aactgcggca aagtaggaga agaaatgggg aggactcggg tgggggagga 
cgtcccggct gggatgaagt ctggtggtgg gtcgtaagtt taggaggtga ctgcatcctc 
cagcatctca actccgtctg tctactgtgt gagacttcgg cggaccatta ggaatgagat 
ccgtgagatc cttccatctt cttgaagtcg cctttagggt ggctgcgagg tagagggttg 
ggggttggtg ggctgtcacg gagcgactgt cgagatcgcc tagtatgttc tgtgaacaca 
aataaaattg atttactgtc tgcaaaaaaa aaaaaaaa 
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»E 7f03bl2.xl NCI_CGAP_CLL1 Homo sapiens cDNA clone IMAGE: 3293567 3», mRNA 
»E secjuence . 



ggtatttact tagctatgat aaagaataaa aagtcattta aaaaacgcga taggacagat 60 

aacagactca caacgtattt agatttaaac actgctggtc tacgtaacct gttacaaaag 12 0 

agagcaaaac ctaactgtca gcatagacat taaagctcac cgttgattat agctcagggc 180 

ctgctcagca ttgtttaaaa agggtcactc acagttttgt caaagagtgc tggtgttctc 24 0 

tatgaactca taaactgttt tatctgaaaa ggtgattttc taagtagtgt aagccatggg 300 

tacatggtgc aaaaagttca tgttctcact cagctggtga gcgaaggatg ggagcagaga 360 

acagagctaa aacccctggt tttcctttcc ccagatgtaa agcctgctag ctggaactca 420 

cagaagattg gaacaaaaag ataggagatg gacacctgng ggactgctcc agcacgaagg 480 

gaagcgatga gcatcacaca gcag " ~ 504 



human full-length cDNA 3-PRIME end of clone CS0DAO09YG15 of NEUROBLASTOMA 
of Homo sapiens (human) 

tttttttttt atttytttaw cacttccaat aaactagcat aagttttatt acaacatata 
cagatttgat acagtttaca aaaaaaacta gatttttcaa ctaaataaaa atgtctttta 
ascmvtkvaa gttggcttag agacatggta tttttctttc aaaactgtgt ttctacaatg 
atttctaagg tcccagtctt gcttgtactt gacagtyacc ctcatctaag caacattaag 240 
akctctgata tctttagtaa agaatacaaa accctgtktt tcttaaaawc ctaatgctga iy0 
aagayatgtt atagccaatc cagacaaaca tttatattta aacatttata tttaaacaaa 
angyctctct gaacaaatag cctgcbgaga taaatacagt gatttgtttt cctgatagaa 
ctatttagca tgtttaacac attattctgt agtttgggaa taagagtgtt tcttcccttg 
aagaaaacag gtccccttct gaagaataat gctgattacc ccccaaaatc aaaatagacc 
agcaccaaat gaagtattaa tttacaaaca tgaacttaga acttagctct tacttcttga 
agttctacat cccagactta ataaattaac tacaaaatca ggagtttcat cagctacagt 660 
ataatttaaa aatccatttt caactggcag gagtgaggga gaaggtcaat tgcactgatc 720 
accatgaact tcaagaattt catcaaaact tttttcccag cttatatttg ccttcagagg 780 
tgagctgtag attaccatct ctgatgcttt aacatacaat attcttgttg aaatctcttc 840 
aaagagcaca gcatgtaaag cactaaactg tgttcagatc tgaggagtct gcatggaaag 900 
acctgagacc tctctgaaga gccaaaaaca a'gtggctgtc tcagtgatmc atctattcat 
cctcacaaga catgcattga gcttttttat tcacagattt atgttagtcc ttagacccat 
gccatgtcca gttcagactg tcggctatca ggygtcttct tgtgcactcy tggagtctgt 1080 
aattgwytcg actgtgtgat mtgtrctttg amarwgtctg cgccatgtgc atagtgacat 1140 
cccagcatac gkccmcmcaw tcggcastgc ggctttcccg gwtwctttct gcctkaacca 1200 

g 
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5E 



602288121P1 NIH_MGC_97 Homo sapiens cDNA clone IMAGE:4373861 5« mRNA 
sequence . 



agcgtggctg gtgaggaagc cgtcgggagc cgccgccgcc atctgaggga ggtaccctgg 60 

aaaccacctt ttatcggtgg ggaagtgcag tcgcggtggg cggctctggg ggccagcgaa 120 

acgggaggcc tctaaatctt tagttggggc tgcattgccc tggagccgca ctcttgagtc 180 

cgaggccatc ttttgttgga gaaggcgtcg gcgttggcgt tttcccgagg ttgggctgta 24 0 

cagtgtctcc gtccgcggaa aaagaagcct ctgaacccgc gccggcccgc agcccccgtg 300 

ccttccggcc gctgctcgcc gtcgccagag gctaggccac gtttccccca gtgccgaggt 360 

gtttctgtga ccctccctcc actcccattc ccttctgaaa gggcacctgc tcttggtgag 420 

aaaagaaatt atagcacgaa gagccagtat cagaagagta tccatcaccc gcagcaaccg 480 

ctcagggaac accatcaaaa aagaacaaaa agggaatatc tggatttcct gggcgaggag 540 

gagcgagtct gctcgggagc tgttccagca ggcgattttt aaatactggt ttctacgccc 600 

tatacaactt ggcttcacat acttttacaa cttaaccttt ttatgatttt aaaaaactgg 660 

tctgttccgg gacttctccg gccgggacac cgggtaacgg aggtctggcc gggctcccgg 720 

cggcccttgg gacctcactt gtgggaacct taacccatcg agacagaaat ccggtgacgg 780 

cgccaagaag ctggaccagg ggcttcggcg tcgaccacac ctgttagagc cggaccatgg 840 

cccgagccgc ggcggggccc cggaggccac aggccaaggc gggcgaggca gcgctcgaaa 900 

acacggtgac cccaagaggg agaagccact agcgcagaag ggaan 945 




Homo sapiens organic anion transporter polypeptide- related protein 1 
(OATPRP1) mRNA, complete cds . 

/translation="MPIjHQLGDKPIiTFPSPNSAMENGIjDHTPPSRRAJSPGTPLSPGSIjR 
S AAHS PLDTS KQ PLCQIjWAEKHGARGTHEWWS AGQS VACGWWAFAPPCIiQVIiNTPKG 
ILFFLCAAAFIjQGMTVNGFINTVITSIjERRYDIjHSYQSGLIASSYDIAACIjCLTFVSYF 
GGSGHKPRWLGWGVLLMGTGSLVFALPHFTAGRYEVEIiDAGVRTCPANPGAVCADSTSG 
LSRYQLVFMLGQFLHGVGATPLYTLGVTYLDENVKSSCSPVYIAIFYTAAILGPAAGYL 
IGGAIOiNIYTEMGRRTELTTESPIiWVGAWWVGFLGSGAAAFFTAVPIIiGYPRQLPGSQR 
YAVMRAAEMHQLKDS SRGEASNPDFGKTIRDLPLS I WLLLKNPTF I LIiCIiAGATEATIj I 
TGMSTFSPKFLESQFSLSASEAATLFGYLVVPAGGGGTFIjGGFFVNKIjRIjRGSAVIKFC 
LFCTWSLLGILVFSLHCPSVPMAGVTASYGGSLLPEGHLl^TAPCNAACSCQPEHYSP 
VCGSDGL^ FSLCHAGCPAATETNVDGQKVYRDCSCI PQNIjS SGFGHATAGKCTSTCQR 
KPLLLVFIFWI FFTFLSS I PALTATLRCVRDPQRSFAIiGIQWIVVRIIiGGI PGPIAFG 
WVIDKACLLWQDQCGQQGSCLWQNSAMSRYILIMGIiLYKVIiGVLFFAIACFLYKPIiSE 

SSDGLETCI>PSQSSAPDSATDSQIiQSSV n 

ggcacgaggc gctgcgcggc gcggcggccg ggccctcgag acggggacgg acacaccagc 
ccctcggata ccacttggcc actcccgctg aggccactcc cactgcgtgg ctgaagcctc 
gaggtcacca ggcggaggcg cggagatgcc cctgcatcag ctgggggaca agccgctcac 
cttccccagc cccaactcag ccatggaaaa cgggcttgac cacaccccac ccagcaggag 
ggcatccccg ggcacacccc tgagccccgg ctccctccgc tccgctgccc atagccccct 
ggacaccagc aagcagcccc tctgccagct ctgggccgag aagcatggcg cccgggggac 
ccatgaggtg cggtacgtct cggccgggca gagcgtggcg tgcggctggt gggccttcgc 
accgccgtgc ctgcaggtcc tcaacacgcc caagggcatc ctgttcttcc tgtgtgcggc 
cgcattcctg caggggatga ctgtgaatgg cttcatcaac acagtcatca cctccctgga 
gcgccgctat gacctgcaca gctaccagag cgggctcatc gccagctcct acgacattgc 
cgcctgcctc tgcctcacct tcgtcagcta cttcgggggc tcagggcaca agccgcgctg 
gctgggctgg ggcgtgctgc ttatgggcac ggggtcgctg gtgttcgcgc tgccccactt 
cacggctggc cgctatgagg tggagttgga cgcgggtgtc aggacgtgcc ctgccaaccc 
cggcgcggtg tgtgcggaca gcacctcggg cctgtcccgc taccagctgg tcttcatgct 
gggccagttc ctgcatggcg tgggtgccac acccctctac acgctgggcg tcacctacct 
ggatgagaac gtcaagtcca gctgctcgcc cgtctacatt gccatcttct acacagcggc 
catcctgggc ccagctgccg gctacctgat tggaggtgcc ctgctgaata tctacacgga 
aatgggccga cggacggagc tgaccaccga gagcccactg tgggtcggcg cctggtgggt 
cggcttcctg ggctctgggg ccgctgcttt cttcaccgcc gttcccatcc ttggttaccc 
tcggcagctg ccaggctccc agcgctacgc ggtcatgaga gcggcggaaa tgcaccagtt 
gaaggacagc agccgtgggg aggcgagcaa cccggacttt gggaaaacca tcagagacct 
gcctctctcc atctggctcc tgctgaagaa ccccacgttc atcctgctct gcctggccgg 
ggccaccgag gccactctca tcaccggcat gtccacgttc agccccaagt tcttggagtc 
ccagttcagc ctgagtgcct cagaagctgc caccttgttt gggtacctgg tggtgccagc 
gggtggtggc ggcaccttcc tgggcggctt ctttgtgaac aagctcaggc tccggggctc 
cgcggtcatc aagttctgcc tgttctgcac cgttgtcagc ctgctgggca tcctcgtctt 
ctcactgcac tgccccagtg tgcccatggc gggcgtcaca gccagctacg gcgggagcct 
cctgcccgaa ggccacctga acctaacggc tccctgcaac gctgcctgca gctgccagcc 
agaacactac agccctgtgt gcggctcgga cggcctcatg tacttctcac tgtgccacgc 
agggtgccct gcagccacgg agacgaatgt ggacggccag aaggtgtacc gagactgtag 
• ctgtatccct cagaatcttt cctctggttt tggccatgcc actgcaggga aatgcacttc 
aacttgtcag agaaagcccc tccttctggt tttcatattc gttgtaattt tctttacatt 
cctcagcagc attcctgcac taacggcaac tctacgatgt gtccgtgacc ctcagagatc 
ctttgccctg ggaatccagt ggattgtagt tagaatacta gggggcatcc cggggcccat 
cgccttcggc tgggtgatcg acaaggcctg tctgctgtgg caggaccagt gtggccagca 
gggctcctgc ttggtgtacc agaattcggc catgagccgc tacatactca tcatggggct 
cctgtacaag gtgctgggcg tcctcttctt tgccatagcc tgcttcttat acaagcccct 
gtcggagtct tcagatggcc tggaaacttg tctgcccagc cagtcctcag cccctgacag 
tgccacagat agccagctcc agagcagcgt ctgaccaccg cccgcgccca cccggccacg 
gcgggcactc agcatttcct gatgacagaa cagtgccgtt gggtgatgca atcacacggg 
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aacttctatt tgacctgcaa 
ctgtccccag agctgtacgg 
atggacacac agtttgcatc 
tggtgtgcgt gaggacaaac 
tgcagcccga ggaaggcttg 
gttatttaag cctgcgaaaa 
aaa 



ccttctactt aacctgtggt 
ccctgcagtg ggtgggagga 
agaacgtgtt tatagaatgt 
tccgcagggg ctgtgaatcc 
tgtgtcctca gttaaaactg 
aaaaaaaaaa aaaaaaaaaa 



ttaaagtcgg ctgtgacctc 
acttgcataa atatatattt 
gttttatacc cgatcgtgtg 
cactgggagg gcggcgggcc 
tgcatatcga aatatatttt 
aaaaaaaaaa aaaaaaaaaa 



Homo sapiens cDNA: FLJ21243 fis, clone COIj01164. 



acaagaatga 
cagaatggga 
gaaagacttg 
tgagatgttt 
aaagctagcc 
ggttgagggg 
cttagcaaca 
ctgtaccttt 
tctggagatt 
ttagcctagg 
tcactgtgag 
ttctgtgggg 
gtgtcagatg 
tatagattgc 
tcccattttc 
cagtgctttc 
tagtagtctt 
tcattcattc 
attttgatat 
ccagtgtcct 
ggggtgtata 
aaatgaagtt 
tccagtttac 
atctcattgt 
tttctatatt 
attctaactt 
gtatgatctt 
atctgctaaa 
actgtgtgtt 
gaactacatt 
tagtaattat 
ttaaaaaaaa 



atgaatgtct 
gagaaatgac 
gtcaactcta 
tagaagagtt 
aggaatgagc 
aagaagatta 
tgtgataatg 
acatgttttt 
tctttggcag 
tgaaaagtag 
atctctaact 
ttattaaaat 
atggtatgga 
cagcagagtt 
ttttgtccag 
ttttttcttt 



cttcaagaag 
atttattttt 
gtaagtatta 
gcttatgatg 
ttgtgtacat 
gatgaacatt 
cttctgttgt 
acagtgtttt 
gaaaggagta 
gtctattcta 
ggtttcatgt 
gagctgtcag 
attttgccag 
tcttgcaaag 
tttatggaaa 
aaaaaaaaaa 



ttgtcttaaa 
tacctttgtt 
atatatctag 
aacctgaaca 
ctaccacatt 
ccagaagtgc 
caaagctgtt 
aaatttaaga 
aaatgccttt 
tcctagcagt 
tttgagtggc 
gcaaaagctt 
atttgttccc 
cagaaataga 
acccatgttg 
ttgataagat 
aaaaccaatt 
aagccaaatg 
atgcattttt 
ctttgttcag 
gtgtcatttt 
aattttgtta 
ctacagcttt 
agttgcaagc 
cagttgaaat 
acctattgtg 
gtttttgaaa 
ttttcattac 
ctgctgcatt 
tacattcctt 
tgttagcaat 



ttttgcccat 
cctactcttt 
aaggaagact 
ctttgaggga 
atttgagaat 
aggaaagaga 
ataacctgtt 
tagtttgtaa 
catctataat 
gtaaatatgt 
aaacagatca 
tatttttttt 
ttgctttccc 
gcagggattt 
gcaatcatgt 
ggatatcaaa 
ctttttctaa 
tcagcagagt 
aaaagatgtc 
attttttgta 
agttaggcat 
ttagtgagtt 
tttaatttta 
agaaagtaga 
tgtagattta 
tacaatctga 
gtgttattgt 
tgactctgta 
agccttcaaa 
tctgtggtat 
tctgtaccaa 



gtgttaaaag 
tatataatta 
atatctggtg 
gagattattc 
atcaaacctc 
agtttgagga 
aatcctacgt 
gaactgtaca 
ttcatggaga 
ataattagag 
agtcttttgc 
aataatgcca 
ccactgttac 
acccgttctt 
atgaactgtg 
aatagttgct 
taatatcctg 
gctgctgctt 
tacattgaaa 
agagaccagt 
tgtaggccaa 
ttttgaattg 
aggtttgact 
atttggtata 
agattgttaa 
ttttttaaaa 
ttaaaaaatg 
aaatacactg 
agtatttgga 
tttgtcctgt 
ctttgaataa 



atgtaattct 60 

tccttttagg 120 

tagactaata 180 

ttgccagcaa 240 

aggcctgggg 3 00 

acacccttgg 360 

actatgtgtt 420 

aaaaaat gc t 480 

actgctttaa 540 

ttttctaatt 600 

tcatagactt 660 

tactccatta 720 

tgcttcagtt 780 

tgcttggaca 840 

ttatacttct 900 

gtgcaaaagt 960 

tgaaattgct 1020 

ttatctagta 1080 

catgttcttc 1140 

tagtacactg 1200 

atgtgattat 1260 

taaatggatt 1320 

aattgtatcc 1380 

aagcaggtta 1440 

aatcatgaca 1500 

ttgtaaacat 1560 

aaaaaagcat 1620 

ttctttgtgt 1680 

aacttaagat 1740 

aactgaagta 1800 

aatgaaaaat 1860 
1880 



)E ab38f03.sl Stratagene HeLa cell s3 937216 Homo sapiens cDNA clone 

•E IMAGE: 843101 3- similar to contains Alu repetitive element;, mRNA sequence. 

tttt 9 agatg aagtctcgcg ctcttgtccc ctaggctgga gtgcaatgat gcgatcttgg 60 

ctaactgcag cttctgcccc ttgggttcaa gtgattctcc tgcctcagcc tcccgagtag 12 0 

ctgggattac aggcgcctgc caccacgccc ggctaatttt tatattttta gttgagacag 180 

ggtttcacca tgttggccag gctgatctcg aactcctgac ctcaggtgat tcgcccgcct 240 

cagcctccca aagtgctggg attacaggca tgagccaccg tgccggacct atttaaaaat 3 00 

ctttttgaag tacagtacta ataaactaag gactacctag agatcacact tttagatatt 3 60 

atctatttta acatagatta aaaatactgt ttatatgaaa attaagctta aatacacgta 420 

taggtaataa ttattttgcc catatacaag taatgtaaac agag ~ 464 



Homo sapiens KPL1 (KPL1) mRNA, complete cds. 

/ trans la t ion= ..MALWGGV^WQSSILRRWKRNWFALWLDGTLGYYHDETAQDEED 
RVLIHFNVI^IKIGPECTDVQPPEGRSRIMIjLTVNIjIffiGGRIiHLCAETKDDAIiAWKTAL 
LEANSTPAPAGATVPPRSI^VCSKVRCVTOSWSPCKVEP^IV^VYSPYQDyyEWPPN 
AHEATYVRSYYGPPYAGPGVTHVIVREDPCYSAGAPIJ^GMIAGAATGAALGSIJWSPC 

WF" 



aagaaatggc cctggtgagg ggcggctggc tgtggagaca gagctccatc ^cgccgct 
ggaagcggaa ctggtttgcc ctgtggctgg acgggaccct gggatactac cacgatgaga 
cagcgcagga cgaggaggac cgtgtgctca tccacttcaa tgtccgtgac ^taaagatcg 
gcccagagtg ccatgatgtg cagcccccag agggccggag ccgagatggc ctgctgactg 
tgaacctacg ggaaggcggc cgcctgcacc tctgtgcgga gaccaaggat gatgccctag 
catggaagac agcactgctg gaggcaaact ccaccccggc cccagctgga gccaccgtcc 
ctcccaggag ccgccgggtt tgctccaagg tcaggtgtgt gacccgctcg tggagcccct 
gtaaggttga gaggcggatc tgggtgcgcg tctacagccc gtaccaagac tactacgagg 
Lgtgccccc caatgcacac gaggccacgt atgtccgcag ctactacgga ccgccctacg 
caggccctgg cgtgacgcac gtgatagtgc gggaggatcc ctgctacagc gccggcgccc 
ctctggccat gggcatgctt gcgggagccg ccactggggc ggcgctgggc tcgctcatgt 
ggtcgccctg ctggttctga gccctgggac tcggagcact gacccctgcg cttggattgc 
lagactcctc ttcctcctgg accccatcct ctaccatcca agccctgtcc cactttggcc 
ctatcctctc cattagctcc ttccgggttt ggaccattcc ccccactccc tacccttaat 
ccccacatgg gaagaagcta tcatcacagg tacaaacatc gcttgaagtc ttcacatcta 
ccactagaca cccccaaaat ctgttataga catttatgga tacatttcct ctaaacacaa 
cagggcacag caaatacgac ttcatttggc ttcgagttcc ccaggcgctg tagacacaac 
atgaatcggg ctctctgctc tctccttagg gagctcgagt cctggtgggg agaacaggag 
taaaccaagg acttgaacaa agctgaagag ttatcagtcc tttgacaagg acaggtgggg 
cagggagcaa gacaggtagg ctggaagaac agttattggc aagtatgcag agccgtgaac 
gtcatggcat gtccaaggaa ttaaatggga gttcatttgg gctggggtgg aggctgggat 
caaaccgtqg tgggccttca agctaaggag cttcctaggt gaaaggggag atgtgagctt 
aagttcatg altgcatcta taatgaatat attgcctgtt ttgtgaatac 
tgaclcatgt ccatacctaa aacactcctg agttaagtcc catccttccc acaaacagct 
tcctggctgg tacccatgat aacaattgag ctgaacctgg ggacccctgg ttggggaaca 
ggtgagttct atttgagact tccagcccta gaaagctgcc tccgtccaga aatgcctctc 
acaccaggag ctcggccctc tctttgtagc tgtgactgtc accctctcag gctttgtctc 
atccttcatt ctgaataaga tggcagtgtt ctcctctggg gcctgatcca cctctacacc 
agcccaggaa gccccatctg tgcctgccct caggtggtcc accagtctcc c^^tttggtt 
cccttccagt ctcttccccc tttctatccc aatcaccaat agaaatgcta acatccctgc 



ctggtagcca ga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1812 



Homo sapiens carboxypeptidase, vitellogenin -like, transcript variant 2 
mRNA (cDNA clone MGC:10029 IMAGE : 3888647) , complete cds. 

/ translation= "IWGATWKVIVSLVIJ^GPmGLFI^ 

TPYIEAGKIQKGRELSLVGPFPGLNMKSYAGFLTVNKTYNSNLFFWFFPAQIQPEDAPV 
VLWLQGGPGGS SMFGL FVEHGPYVVTSNMTLRDRDFP WTTTLSML Y I DNP VGTGFS FTD 
DTHGYAVNEDDVARDIi YS ALI QFFQ I FPE YKNNDFYVTGES YAGKYVPAI AHLIHS LNP 
VREVKINLNGIAIGDGYSDPES I IGGYAEFLYQIGLIiDEKQKKYFQKQCHECIEHIRKQ 
NWFEAFE I IjDKLIjDGDLTSDPS YFQNVTGCSNYYNFLRCTE PEDQL YYVKFL SLPE VRQ 
A IHVGNQTFNDGT I VE KYL REDTVQ S VKP WLTE I MNNYKVL I YNGQLD 1 1 VAAAL TER S 

IiMGMDWKGSQEYKKAEKKVWKIFKSDSEVAGYIRQVGDFHQVIIRGGrGHILPYDQPLiRA 
FDMINRF I YGKGWD P YVG " * 

agcgctgcaa ggacaaccgg ctggggtcct tgcgcgccgc ggctcaggga 
ctgcgccgcg taagtgccgc ctgccctgcg tgggtcgtgc cagctcagcg 
tcgcctcggt ccctcggact tagggagcgc ggggcagacc ctgagagatg 
tgtggaaggt gattgtttcg ctggtcctgt tgatgcctgg cccctgtgat 
gctccctata cagaagtgtt tccatgccac ctaagggaga ctcaggacag 
tcacccctta cattgaagct gggaagatcc aaaaaggaag agaattgagt 
ctttcccagg actgaacatg aagagttatg ccggcttcct caccgtgaat 
acagcaacct cttcttctgg ttcttcccag ctcagataca gccagaagat 
ttctctggct acagggtggg ccgggaggtt catccatgtt tggactcttt 
ggccttatgt tgtcacaagt aacatgacct tgcgtgacag agacttcccc 
cgctctccat gctttacatt gacaatccag tgggcacagg cttcagtttt 
cccacggata tgcagtcaat gaggacgatg tagcacggga tttatacagt 
agtttttcca gatatttcct gaatataaaa ataatgactt ttatgtcact 
atgcagggaa atatgtgcca gccattgcac acctcatcca ttccctcaac 
aggtgaagat caacctgaac ggaattgcta ttggagatgg atattctgat 
ttataggggg ctatgcagaa ttcctgtacc aaattggctt gttggatgag 
agtacttcca gaagcagtgc catgaatgca tagaacacat caggaagcag 
aggcctttga aatactggat aaactactag atggcgactt aacaagtgat 
tccagaatgt tacaggatgt agtaattact ataacttttt gcggtgcacg 
atcagcttta ctatgtgaaa tttttgtcac tcccagaggt gagacaagcc 
ggaatcagac ttttaatgat ggaactatag ttgaaaagta cttgcgagaa 
agtcagttaa gccatggtta actgaaatca tgaataatta taaggttctg 
gccaactgga catcatcgtg gcagctgccc tgacagagcg ctccttgatg 
ggaaaggatc ccaggaatac aagaaggcag aaaaaaaagt ttggaagatc 
acagtgaagt ggctggttac atccggcaag tgggtgactt ccatcaggta 
gtggaggaca tattttaccc tatgaccagc ctctgagagc ttttgacatg 
tcatttatgg aaaaggatgg gatccttatg ttggataaac taccttccca 
tcagaggttt tcattgctga aaagaaaatc gtaaaaacag aaaatgtcat 
aaattatctt ttcatatctg caagattttt ttcatcaata aaaattatcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 



ggagcaccga 


60 


ggacaggccc 


120 


y u tyyuycca 


ISO 


gyyc tyt t tic 


240 


f+ f% 4v 4— -^t 4— 4b> 

ccatuatttc 


300 


uuggncggcc 


360 


aagacttaca 


420 


gccccagtag 


480 


gtggaacatg 


540 


tggaccacaa 


600 


actgatgata 


660 


gcactaattc 


720 


ggggagtctt 


780 


cctgtgagag 


840 


cccgaatcaa 


900 


aagcaaaaaa 


960 


aactggtttg 


1020 


ccttcttact 


1080 


gaacctgagg 


1140 


atccacgtgg 


1200 


gatacagtac 


1260 


atctacaatg 


1320 


ggcatggact 


1380 


tttaaatctg 


1440 


attattcgag 


1500 


attaatcgat 


1560 


aaagagaaca 


1620 


aggaataaaa 


1680 


ttgaaaaaaa 


1740 




1772 



Homo sapiens teratocarcinoma -derived growth factor 1, mRNA (cDNA clone 
MGC:24110 IMAGE : 4615416) , complete cds . 

/trans lat ion= 11 MDCRKMARFSYSVTWIMAI SKAFELGLVAGLGHQEFARPSRGYLA 
FRDDS I WPQEEPAIRPRS S QRVPPMGI QHS KELNRTCCLNGGTCMLGS FCACPPS FYGR 
NCEHDVRKENCGSVPHDTWLPKKCSLCKCWHGQLRCFPQAFIjPGCDGLVMDEHIjVASRT 

PELPPSARTTTFMLVGICLSIQSYY" 

agtttcccct ggacgccttg ctcctgcttc tgctacgacc ttctggggaa aacgaatttc 
tcattttctt cttaaattgc cattttcgct ttaggagatg aatgttttcc tttggctgtt 
ttggcaatga ctctgaatta aagcgatgct aacgcctctt ttccccctaa ttgttaaaag 
ctatggactg caggaagatg gcccgcttct cttacagtgt gatttggatc atggccattt 
ctaaagcctt tgaactggga ttagttgccg ggctgggcca tcaggaattt gctcgtccat 
ctcggggata cctggccttc agagatgaca gcatttggcc ccaggaggag cctgcaattc 
ggcctcggtc ttcccagcgt gtgccgccca tggggataca gcacagtaag gagctaaaca 
gaacctgctg cctgaatggg ggaacctgca tgctggggtc cttttgtgcc tgccctccct 
ccttctacgg acggaactgt gagcacgatg tgcgcaaaga gaattgtggg tctgtgcccc 
atgacacctg gctgcccaag aagtgttccc tgtgtaaatg ctggcacggt cagctccgct 
gctttcctca ggcatttcta cccggctgtg atggccttgt gatggatgag cacctcgtgg 
cttccaggac tccagaacta ccaccgtctg cgcgtactac cacttttatg ctagttggca 
tctgcctttc tatacaaagc tactattaat cgacattgac ctatttccag aaatacaatt 
ttagatatca tgcaaatttc atgaccagta aaggctgctg ctacaatgtc ctaactgaaa 
gatgatcatt tgtagttgcc ttaaaataat gaatacattt ccaaaatggt ctctaacatt 
tccttacaga actacttctt acttctttgc cctgccctct cccaaaaaac tacttctttt 
ttcaaaagaa agtcagccat atctccattg tgcctaagtc cagtgtttct tttttttttt 
tttttgagac ggagtctcac tctgtcaccc aggctggact gcaatgacgc gatcttggtt 
cactgcaacc tccgcatccg gggttcaagc cattctcctg cctcagcctc ccaagtaact 
gggattacag gcatgtgtca ccatgcccag ctaatttttt tgtattttta gtagagatgg 
gggtttcacc atattggcca gtctggtctc gaactcctga ccttgtgatc cactcgcctc 
agcctctcga agtgctgaga ttacacacgt gagcaactgt gcaaggcctg gtgtttcttg 
atacatgtaa ttctaccaag gtcttcttaa tatgttcttt taaatgattg aattatatgt 
tcagattatt ggagactaat tctaatgtgg accttagaat acagttttga gtagagttga 
tcaaaatcaa ttaaaatagt ctctttaaaa ggaaagaaaa catctttaag gggaggaacc 1500 
agagtgctga aggaatggaa gtccatctgc gtgtgtgcag ggagactggg taggaaagag is so 

gaagcaaata gaagagagag gttgaaaaac aaaatgggtt acttgattgg tgattaggtg 
gtggtagaga agcaagtaaa aaggctaaat ggaagggcaa gtttccatca tctatagaaa 
gctatataag acaagaaatc cccttttttt cccaaaggca aaaaaaaaaa aaaaaaaaaa 
aaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1560 
1620 
1680 
1740 
1748 



Homo sapiens lipase mRNA, complete cds. 



/ 1 rails 1 at i on= n MDLDWNMFVT AGGTLAI PILAFVAS FliLWPSALIRI YYWYWRRT 
LGMQVRYVHHEDYQFCtfSFRGRPGHKPS ILMLHGFSAHKDM^ 

MPGHEGTTRS SLDDLS IDGQVKRIHQFVECLKIiNKKPFHLVGTSMGGQVAGVYAAYYPS 
DVSSLCLVCPAGIiQYSTDNQFVQRLKEIiQGSAAVEKIPLIPSTPEEMSEMLQLCSYVRF 
KVPQQILQGLVDWIPHNNFYRKLFIjEIVSEKSR^ 
DVSGADNVGQVNCQIiPGGASGKLWALSSDGKNPGRQPSS" 



gccgggtcgg ggcggggcgg cttttctgtc ggaggacgcg aaccggcacg ctgcgccttt 60 

aaggagtccg gctgggctgg gcgccggagc tgggagccgc gcgggtagga gcccggcggc 120 

aggtcccagc ccggggctag agaccgaggg ccggggtccg ggcccggcgg cgggacccag 180 

gcggttgagg ctggtcagaa tctcattttc aggacccagg cggttgaggc tggtcaggag 240 

tcagccagcc tgaaagagca ggatggatct tgatgtggtt aacatgtttg tgattgcggg 300 

cggcacgctg gccatcccaa tcctggcatt tgtggcttca tttcttctgt ggccttcagc 360 

actgataaga atctattatt ggtactggcg gaggacattg ggcatgcaag tccgctatgt 420 

tcaccatgaa gactatcagt tctgttattc cttccggggc aggcctgggc acaaaccctc 480 

catcctcatg ctccacggat tctctgccca caaggatatg tggctcagtg tggtcaagtt 540 

ccttccaaag aacctgcact tggtctgcgt ggacatgcca ggacatgagg gcaccacccg 600 

ctcctccctg gatgacctgt ccatagatgg gcaagttaag aggatacacc agtttgtaga 660 

atgcctgaag ctgaacaaaa aacctttcca cctggtaggc acctccatgg gtggccaggt 720 

ggctggggtg tatgctgctt actacccatc ggatgtctcc agcctgtgtc tcgtgtgtcc 780 

tgctggcctg cagtactcaa ctgacaatca atttgtacaa cggctcaaag aactgcaggg 840 

ctctgccgcc gtggagaaga ttcccttgat cccgtctacc ccagaagaga tgagtgaaat 900 

gcttcagctc tgctcctatg tccgcttcaa ggtgccccag cagatcctgc aaggccttgt 960 

cgatgtccgc atccctcata acaacttcta ccgaaagttg tttttggaaa tcgtcagtga 1020 

gaagtccaga tactctctcc atcagaacat ggacaagatc aaggttccga cgcagatcat 1080 

ctgggggaaa caagaccagg tgctggatgt gtctggggca gacaatgttg gccaagtcaa 1140 

ttgccaactg ccaggtggag cttctggaaa actgtgggca ctcagtagtg atggaaaaaa 1200 

cccaggaaga cagccaagct cataatcgac tttttagctt ctgtgcaaaa cacagacaaa 1260 

caacaagaag ctggacttga aggcccccaa ctgcagcctg gaatttttga acacagcatt 1320 

ctgcttccca ttcccccaag ttttgacgca gccaaccatt tttaagggat tcctgcccca 1380 

aattgcggtt ggaagcgcca attgacccct ggaggaaagc ccgtcccctt attccccggg 1440 

tatccacggt tccccagagc tttggggacc acggcgaaaa cctccaagat a 1491 



Homo sapiens v-fos FBJ murine osteosarcoma viral oncogene homolog, mRNA 
(cDNA clone MGC:11074 IMAGE : 3688670) , complete cds . 

/translations "MMFSGFNADYEASSSRCSSASPAGDSIiSYYHSPADSFSSMGSPVN 
AQDFCTOIAVSSANFIPTVTAISTSPDL^^ 

AYSRAG WKTMTGGRAQS I GRRGKVEQLSPEEEEKRRIRRERNKMAAAKCRNRRRELTD 
TLQAETDQLEDEKSALQTEIANIjLKEKEKLEFIIiAAHRPACKIPDDIiGFPEEMSVASLD 
LTGGLPEVATPESEEAFTLPLIjNDPEPKPSVEPVKSISSMELKTEPFDDFIjFPASSRPS 

GSETARSVTDMDLSGSFYAADVraP^ 

FWTYPEADSFPSC^AAHRKGSSSNEPSSDSLSSPTIlIiAIl ,, 

ccaagactga gccggcggcc gcggcgcagc gaacgagcag tgaccgtgct cctacccagc 
tctgcttcac agcgcccacc tgtctccgcc cctcggcccc tcgcccggct ttgcctaacc 
gccacgatga tgttctcggg cttcaacgca gactacgagg cgtcatcctc ccgctgcagc 
agcgcgtccc cggccgggga tagcctctct tactaccact cacccgcaga ctccttctcc 
agcatgggct cgcctgtcaa cgcgcaggac ttctgcacgg acctggccgt ctccagtgcc 
aacttcattc ccacggtcac tgccatctcg accagtccgg acctgcagtg gctggtgcag 
cccgccctcg tctcctctgt ggccccatcg cagaccagag cccctcaccc tttcggagtc 
cccgccccct ccgctggggc ttactccagg gctggcgttg tgaagaccat gacaggaggc 
cgagcgcaga gcattggcag gaggggcaag gtggaacagt tatctccaga agaagaagag 
aaaaggagaa tccgaaggga aaggaataag atggctgcag ccaaatgccg caaccggagg 
agggagctga ctgatacact ccaagcggag acagaccaac tagaagatga gaagtctgct 
ttgcagaccg agattgccaa cctgctgaag gagaaggaaa aactagagtt catcctggca 
gctcaccgac ctgcctgcaa gatccctgat gacctgggct tcccagaaga gatgtctgtg 
gcttcccttg atctgactgg gggcctgcca gaggttgcca ccccggagtc tgaggaggcc 
ttcaccctgc ctctcctcaa tgaccctgag cccaagccct cagtggaacc tgtcaagagc 
atcagcagca tggagctgaa gaccgagccc tttgatgact tcctgttccc agcatcatcc 
aggcccagtg gctctgagac agcccgctcc gtgccagaca tggacctatc tgggtccttc 
tatgcagcag actgggagcc tctgcacagt ggctccctgg ggatggggcc catggccaca 
gagctggagc ccctgtgcac tccggtggtc acctgtactc ccagctgcac tgcttacacg 
tcttccttcg tcttcaccta ccccgaggct gactccttcc ccagctgtgc agctgcccac 
cgcaagggca gcagcagcaa tgagccttcc tctgactcgc tcagctcacc cacgctgctg 
gccctgtgag ggggcaggga aggggaggca gccggcaccc acaagtgcca ctgcccgagc 
tggtgcatta cagagaggag aaacacatct tccctagagg gttcctgtag acctagggag 
gaccttatct gtgcgtgaaa cacaccaggc tgtgggcctc aaggacttga aagcatccat 
gtgtggactc aagtccttac ctcttccgga gatgtagcaa aacgcatgga gtgtgtattg 
ttcccagtga cacttcagag agctggtagt tagtagcatg ttgagccagg cctgggtctg 
tgtctctttt ctctttctcc ttagtcttct catagcatta actaatctat tgggttcatt 
attggaatta acctggtgct ggatattttc aaattgtatc tagtgcagct gattttaaca 
ataactactg tgttcctggc aatagtgtgt tctgattaga aatgaccaat attatactaa 
gaaaagatac gactttattt tctggtagat agaaataaat agctatatcc atgaaaaaaa 



aaaaaaaaaa aaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
132 0 
1380 
144 0 
1500 
1560 
1620 
1680 
1740 
1800 
1814 



Homo sapiens endoplasmic reticulum lumenal Ca2+ binding protein grp78 mRNA 
complete cds . " ' 



/ translation^ « MKLSLVAAMLLIiLS AARAEEEDKKEDVGTWGIDLGTTYS CVGVF 

KNGRVE 1 I ANDQGNRITP S YVAFTPEGERLIGDAAKNQLTSNPENTVFDAKRL IGRTWN 

DPS VQQD I KFLPFKWEKKTKPYI QVD IGGGQTKTPAPEE I SAMVLTKMKETAEAYLGK 

KVTHAVVTVPAYFNDAQRQATKDAGTIAGLNVMRI INEPTAAAIAYGLDKREGEKWILV 

FDLGGGTFDVSLLTIDNGVFEVVATNGDTHLGGEDFDQRVMEHF I KLYKKKTGKDVRKD 

NRAVQKLRREVEKAKRAL S SQHQARIE IES FYEGEDFSETLTRAKFEELNMDIiFRS TMK 

PVQKVLEDSDLKKSD IDE I VLVGGSTRI PKIQQIiVKEFFNGKEPSRGINPDEAVAYGAA 

VQAGVLSGDQDTGDLVIjLDVCPLTLGIETVGGVMTKL I PRNTWPTKKSQI FS TASDNQ 

PTVTI KVYEGERPLTKDNHIiLGTFDLTGIPPAPRGVPQ IEVTFE IDVNGI LRVTAEDKG 

TGNKNKITITNDQNRLTPEEIERMVNDAEKFAEEDKKLKERIDTRNELESYAYSLKNQI 

GDKEKLGGKLSSEDKETMEKAVEEKIEWLESHQDADIEDFKAKKKELEEIVQPIISKIjY 
GSAGPPPTGEEDTAEKDEL " 



gcgcggcgcg 


ggccgaggag 


60 


acctggggac 


cacctactcc 


12 0 


ccaacgatca 


gggcaaccgc 


180 


gtctgattgg 


cgatgccgcc 


240 


acgccaagcg 


gctcatcggc 


300 


tcttgccgtt 


caaggtggtt 


360 


gtgggcaaac 


aaagacattt 


420 


aagaaaccgc 


tgaggcttat 


480 


cctattttaa 


tgatgcccaa 


540 


atgttatgag 


gatcatcaac 


600 


gggaggggga 


gaagaacatc 


660 


ttctcaccat 


tgacaatggt 


720 


gtggagaaga 


ctttgaccag 


780 


cgggcaaaga 


tgtcaggaaa 


840 


aggccaaacg 


ggccctgtct 


900 


aaggagaaga 


cttttctgag 


960 


tgttccggtc 


tactatgaag 


1020 


ctgatattga 


tgaaattgtt 


1080 


tggttaaaga 


gttcttcaat 


1140 


tagcgtatgg 


tgctgctgtc 


1200 


tggtactgct 


tgatgtatgt 


1260 


ccaaactgat 


tccaaggaac 


1320 


cttctgataa 


tcaaccaact 


1380 


aagacaatca 


tcttctgggt 


1440 


tcccacagat 


tgaagtcacc 


1500 


aagacaaggg 


tacagggaac 


1560 


cacctgaaga 


aatcgaaagg 


1620 


agctcaagga 


gcgcattgat 


1680 


atcagattgg 


agataaagaa 


1740 


tggaaaaagc 


tgtagaagaa 


1800 


aagacttcaa 


agctaagaag 


1860 


tctatggaag 


tgcaggccct 


1920 


tgtag 




1965 



Homo sapiens S100 calcium binding protein A2, mSNA (cDNA clone M GC:3847 
IMAGE:3659591) , complete cds . 

/translation="MCSSIiEQAIjAVLVTTFHKySCQEGDKFKXiSKQEM^ELLHKELPSF 



trans lat ion- » MCS SLEQALAVLVTTFHK¥SCQEGDKFKJ-ib 
^E^nJEEGliKKLMGSIiDENSDQQVDFQEYAVFliALITVMCNDFFQGCPDRP 

ctcccctcac cccggtccag gatgcccagt ccccacgaca cctcccactt «cactgtgg 
Sctaaataqq ctclggggct gcccttgacc tggcctagag ccctccccca gctggtggtg 
gagcggcac tctcSSag Igagggggct gggagggaat gagtgggaat ^aagaggc 
?aggg?ttgg tgggatcagg ttgaggcagg tttggtttcc ttaaaatgcc a «9"ggggg 
ccISggggc cclcatataa atcctcaccc tgggagcctg gctgccttgc tctcottcct 
Zallclllll ctgccacctg gtctgccaca gatccatgat gtgcagttct ctggagcagg 

IS SS=S -™ ™- «. 

S S3S S5S =353 » «S 

acttcttcca gggctgccca gaccgaccct 9aagcagaac ^cttgacttc ^gccatgga 
tcttttgggc ccaggactgt tgatgccttt gagttttgta ttcaataaac ttfcttttgtc ^ 
tgttgaaaaa aaaaaaaaaa aaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 



DE waOlcll.xl NCI_CGAP_Kidll Homo sapiens cDNA clone IMAGE: 2296820 3' mRNA 
3E sequence. ' 



acttccttca ctagttacga caaaatttaa gaggaataac aaatacaaat tttctgttaa 60 

gaacggaaag gtgcaaacta gcagagtcaa tactggtaac cagaaggcac taatccaaac 120 

acataaattt caaaagctgg ttatattatg gaataccata tatactggcc tttgccagtt 180 

tgggatttct gcaatagcaa taagcctcgt ttctgtttcc aattataaca acaaaaagat 240 

gagttactaa tgaacattcc acttacagaa gtctaggcta tgttgataaa ttgaaaactt 300 

atctagacta ctctgtctaa gagcaataaa aagtaaacac tcttttatcc agcagcacta 360 

ggaaacaggg tgaatttacc aagataaatt aggttgggga tacctactgc caacttgtgc 420 

ggttgtcgaa ttcactgtaa tatgtattcc tcttattgat agagctctga atgtaaacaa 480 
ccta 



484 



Human 150 kDa oxygen- regulated protein ORP150 niRNA, complete cds. 

/ translations « MADKVRRQRPR^VCWAIiVAVLI^LLALSDTLAVMSVDLGSESM 

K^AIVKPGVPMEIVLNKESRRKTPVIVTLKENERFFGDSAASMAIKNPKATLRYFQHLL 

GKQADNPHVAIiYQARFPEHEliTFDPQRQT\7HFQlSSQLQFSPEEVLGMVl.NYSRSLAED 

FAEQPIKDAVITVPVFFNQAERRAVLQAARMAGIoK\n^LINDNTATALSYGVFRRKDIN 

TTAQNIMFYDMGSGSTVCTIVTYQMVKTKEAGMQPQLQIRGVGFDRTLGGLEMELRIiRE 

RIAGLFNEQRKGQRAKDWENPRAMAKLLREA^KTVLSANADHMAQIEGLMDDVDFK 

AKVTRVEFEELCADLFERVPGPVQQALQSAEMSLDEIEQVILVGGATRVPRVQEVLLKA 

VGKEELGKNINADEAAAMGAVYQAAALSKAFKVKPFVVRDAWYPILVEFTREVEEEPG 

IHSLKHNKRVLFSRMGPYPQRKVITFNRYSHDFNFHINYGDLGFLGPEDLRVFGSQNLT 

TVKLKGVGDSFKKYPDYESKGIKAHFNLDESGVLSLDRVESVFETLVEDSAEEESTLTK 

LGNTISSLFGGGTTPDAKENGTDTVQEEEESPAEGSKDEPGEQVELKEEAEAPVEDGSQ 

PPPPEPKGDATPEGEKATEKENGDKSEAQKPSEKAEAGPEGVAPAPEGEKKQKPARKRR 

MVEEIGVELVVUJIiPDLPEDKLAQSVQKLQDLTLRDLEKQEREKAANSLEAFIFETQDK 

LYQPEYQEVSTEEQREEISGKI.SAASTWLEDEGVGATTVMIiKEKIAELRKIiCQGI.FFRV 

EERKKWPERLSALDNLLNHSSMFLKGARLIPEMDQIFTEVEMTTLEKVIl^TWAWKNAT 

lAEQAKLPATEKPVLLSKDIEAKMMALDREVQYLLNKAKFTKPRPRPKDKNGTRAEPPLi 



QKRPLKNDEL" 

ttgtgaaggg cgcgggtggg gggcgctgcc ggcctcgtgg gtacgttcgt gccgcgtctg 
tcccagagct ggggccgcag gagcggaggc aagaggggca ctatggcaga caaagttagg 
aggcagaggc cgaggaggcg agtctgttgg gccttggtgg ctgtgctctt 39 c *3acctg 
ttggcactga gtgatacact ggcagtgatg tctgtggacc tgggcagtga gtccatgaag 
gtagccattg tcaaacctgg agtgcccatg gaaattgtct tgaataagga atctcggagg 
aaaacaccgg tgatcgtgac cctgaaagaa aatgaaagat tctttggaga cagtgcagca 
agcatggcga ttaagaatcc aaaggctacg ctacgttact tccagcacct cctggggaag 
caggcagata acccccatgt agctctttac caggcccgct tcccggagca cgagctgact 
ttcgacccac agaggcagac tgtgcacttt cagatcagct cgcagctgca gttctcacct 
gaggaagtgt tgggcatggt tctcaattat tctcgttctc tagctgaaga ttttgcagag 
cagcccatca aggatgcagt gatcaccgtg ccagtcttct tcaaccaggc cgagcgccga 
gctgtgctgc aggctgcteg tatggctggc ctcaaagtgc tgcagctcat caatgacaac 
accgccactg ccctcagcta tggtgtcttc cgccggaaag atattaacac cactgcccag 
aatatcatgt tctatgacat gggctcaggc agcaccgtat gcaccattgt gacctaccag 
atggtgaaga ctaaggaagc tgggatgcag ccacagctgc agatccgggg agtaggattc 
gaccgtaccc tggggggcct ggagatggag ctccggottc gagaacgcct ggctgggctt 
ttcaatgagc agcgcaaggg tcagagagca aaggatgtgc gggagaaccc gcgtgccatg 
gccaagctgc tgcgtgaggc taatcggctc aaaaccgtcc tcagtgccaa cgctgaccac 
atggcacaga ttgaaggcct gatggatgat gtggacttca aggcaaaagt gactcgtgtg 
gaatttgagg agttgtgtgc agacttgttt gagcgggtgc ctgggcctgt acagcaggcc 
ctccagagtg ccgaaatgag tctggatgag attgagcagg tgatcctggt gggtggggcc 
actcgggtcc ccagagttca ggaggtgctg ctgaaggccg tgggcaagga ggagctgggg 
aagaacatca atgcagatga agcagccgcc atgggggcag tgtaccaggc agctgcgctc 
agcaaagcct ttaaagtgaa gccatttgtc gtccgagatg cagtggtcta ccccatcctg 
gtggagttca cgagggaggt ggaggaggag cctgggattc acagcctgaa gcacaataaa 
cgggtactct tctctcggat ggggccctac cctcaacgca aagtcatcac ctttaaccgc 
tacagccatg atttcaactt ccacatcaac tacggcgacc tgggcttcct ggggcctgaa 
gatcttcggg tatttggctc ccagaatctg accacagtga agctaaaagg ggtgggtgac 
agcttcaaga agtatcctga ctacgagtcc aagggcatca aggctcactt caacctggat 
gagagtggcg tgctcagtct agacagggtg gagtctgtat ttgagacact ggtagaggac 
agcgcagaag aggaatctac tctcaccaaa cttggcaaca ccatttccag cctgtttgga 
ggcggtacca caccagatgc caaggagaat ggtactgata ctgtccagga ggaagaggag 
agccctgcag aggggagcaa ggacgagcct ggggagcagg tggagctcaa ggaggaagct 
gaggccccag tggaggatgg ctctcagccc ccaccccctg aacctaaggg agatgcaacc 
cctgagggag aaaaggccac agaaaaagaa aatggggaca agtctgaggc ccagaaacca 
agtgagaagg cagaggcagg gcctgagggc gtcgctccag ccccagaggg agagaagaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



tac 



ggttgttctg 


2220 


tcaggacttg 


2280 


ggaagcgttc 


2340 


cacagaggag 


2400 


ggatgagggt 


2460 


gctgtgccaa 


2520 


tgccctcgat 


2580 


agagatggac 


2640 


gacctgggcc 


2700 


gcctgtgttg 


2760 


gtatctgctc 


2820 


gacccgggca 


2880 


tccagcaggc 


294 0 


atccgagcca 


3000 


gaaagaacaa 


3060 


tctgttttcc 


3120 


gggttggggg 


3180 


ggaggtgtgg 


3240 


ggttaaaaat 


3300 


ccctaaattt 


3360 


aatggctgcc 


3420 


ttccactctt 


3480 


tcctttcagg 


3540 


tgtgctcttt 


3600 


gcagctcctt 


3660 


ccttgcctaa 


3720 


gtatttccca 


3780 


cctctccgaa 


3840 


tctgtcataa 


3900 


tcctcctctc 


3960 


ccctgggctg 


4020 


ggagagacca 


4080 


atgaaggagg 


4140 


cccgacccac 


4200 


gctagggctg 


4260 


cttttccttt 


4320 


accccacccc 


4380 


acaggtttga 


4440 


tattcttttc 


4500 




4503 




Homo sapiens s-CaBPl (CABP1) mRNA, complete cds. 

/ trans 1 a t ion= " MGNCVKYPLMJLSRKDRSI»RPEEIEEIjREAFREFDKDKDGYINCR 
DLGNCMRTMGYMPTEMELIEIiSQ 

RDAFREFDTNGDGEISTSEIiREAMRKLLGHQVGHRDIEEI IRDVDLNGD GRVDFEE FVR 
MMSR" 

aagtccctca gtcccccagg agcctccttc atggacccgg ggatcccaag aggggctgcc 
tcaacttagg atgggcaact gtgtcaagta tccactgaga aatctctcaa ggaaggatag 
atcactgcga ccagaggaaa ttgaagagct ccgagaggcc ttcagagaat tcgacaagga 
caaggatggc tacatcaact gccgggatct gggcaactgc atgcgcacca tgggctacat 
gcccaccgag atggagctca tcgaactgtc ccagcagatc aacatgaacc tgggtggcca 
tgtagatttt gatgacttcg tggagctaat ggggcctaaa ctcctggcag agacagcaga 
tatgattggt gtaaaggaac tgcgagatgc tttccgagag tttgacacca atggtgatgg 
ggaaataagc accagtgagc tgcgagaggc tatgaggaag ctcctgggtc atcaggtggg 
acaccgagac atagaggaaa ttatccgaga tgtggacctc aatggggatg gacgagtgga 
ctttgaagag tttgtccgga tgatgtcccg ctgaggccgc gagggcccct. ccaggactgc 
caagctccca aaggcggggc taagaggagc tagagcttgc ctcacccgct gtagccgccg 
agagcccagg atgtactggc ggatggggcc tgcctgcacc ccggggcgga attc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



E 
E 



Homo sapiens cDNA FLJ123 97 fis, clone MAMMA1002769, weakly similar to Homo 
sapiens cell cycle progression restoration 8 protein (CPR8) mRNA. 



T /translation="MSENSSDSDSSCGWWISHEGSDIEMIiNSVTPTDSCEPAPECSSL 
T EQEELQAIiQI EQGES S QNGTVLMEETAYPALEETS S T IEAEEQKI PEDS I YI GTASDDS 

DIVTLEPPKLEEIGNQEVVIVEEAQSSEDFNMGSSSSSQYTFCQPETVFSSQPSDDESS 
SDETSNQPSPAFRRRRARKKTVSASESEDRLVAEQETEPSKELSKRQFSSGI^ 

LVIAISMGFGHFYGTIQIQKRQQLVRKIHEDEI^MKDYIiSQCQQEQGSFIDYKSLKEN 
LARCWTLTEAEKMSFETQKTNIiATENQYIiRKLFTD 

KLDEYIYIUIFFGHTFSPPYGPSRPDKKQRMVNIENSRHRKQEQKHLQPQPYKREGKWHK 
T YGRTNGRQMANLEIELGQLPFDPQY" 

ataagaggcg tcattggcgc ccgagctgtg accgccgcca ctggggcagc cagcacaatc 60 

gggcggaggt ggcgctgccc cttcagacct gaaagatgtc tgaaaattcc agtgacagtg 12 0 

attcatcttg tggttggact gtcatcagtc atgaggggtc agatatagaa atgttgaatt 180 

ctgtgacccc cactgacagc tgtgagcccg ccccagaatg ttcatcttta gagcaagagg 240 

agcttcaagc attgcagata gagcaaggag aaagcagcca aaatggcaca gtgcttatgg 300 

aagaaactgc ttatccagct ttggaggaaa ccagctcaac aattgaggca gaggaacaaa 360 

agatacccga agacagtatc tatattggaa ctgccagtga tgattctgat attgttaccc 420 

ttgagccacc taagttagaa gaaattggaa atcaagaagt tgtcattgtt gaagaagcac 480 

agagttcaga agactttaac atgggctctt cctctagcag ccagtatact ttctgtcagc 540 

cagaaactgt attttcatct cagcctagtg acgatgaatc aagtagtgat gaaaccagta 600 

atcagcccag tcctgccttt agacgacgcc gtgctaggaa gaagaccgtt tctgcttcag 660 

aatctgaaga ccggctagtt gctgaacaag aaactgaacc ttctaaggag ttgagtaaac 720 

gtcagttcag tagtggtctc aataagtgtg ttatacttgc tttggtgatt gcaatcagca 780 

tgggatttgg ccatttctat ggcacaattc agattcagaa gcgtcaacag ttagtcagaa 84 0 

agatacatga agatgaattg aatgatatga aggattatct ttcccagtgt caacaggaac 900 

aaggatcttt tatagattat aagtcattga aagaaaatct tgcaaggtgt tggacactta 960 

ctgaagcaga gaagatgtcc tttgaaactc agaaaacgaa ccttgctaca gaaaatcagt 1020 

atttaagaaa gctcttcact gactttgtta atgatgttaa agattatctt agaaacatga 10 80 

aggaatatga agtagataat gatggagtat ttgagaagtt ggatgaatat atatatagac 1140 

acttctttgg tcacactttt tcccctccat atggacccag tcgacctgat aaaaagcaac 1200 

gtatggtaaa tattgaaaac tccaggcatc gaaaacaaga gcagaagcac cttcagccac 1260 

agccttataa aagggaaggt aaatggcata aatatggtcg cactaatgga agacaaatgg 1320 

caaatcttga aatagaattg gggcaattac cttttgatcc tcaatactga ttcacaattg 1380 

agttaaatta gacaactgta agagaaaaat ttatgctttg tataatgttt ggtattgaaa 1440 

ctaatgaaat taccaagatg acaatgtctt ttcttttgtt tctaagtatc agtttgataa 1500 

ctttatatta ttcctcagaa gcattagtta aaagtctact aacctgcatt ttcctgtagt 1560 

ttagcttcgt tgaatttttt ttgacactgg aaatgttcaa ctgtagtttt attaaggaag 1620 

ccaggcatgc aacagatttt gtgcatgaaa tgagacttcc tttcagtgta agagcttaaa 1680 

gcaagctcag tcatacatga caaagtgtaa ttaacactga tgtttgtgtt aaatttgcag 1740 

cagagcttga gaaaagtaca ttgttctgga atttcatcat taacatttta taatcttaca 1800 

ctcacttctt gtctttttgt gggttcagga gccctctgac ttgtgaagaa tttgctgccc i860 

tcttaagagc ttgctgactt gttttcttgt gaaatttttt gcacatctga atatcgtgga 1920 

agaaacaata aaactacacc atgag 1945 



hn58g08.xl NCI_CGAP_Kidll Homo sapiens cDNA clone IMAGE : 3032126 3', mRNA 
sequence . 

cattgcttta cgtagatagt aaactatgca tagtatttta tttgtaaccc catgtgttaa 
qaagggacac tgttaaagta acaatcattt aaaagtaaca accaacaaac tggtatttaa 
tttggtattt taaatagtta aaaatcaaat ggaaacagtg tctaaagtca ctaagataat 
tcataacaaa acccattaat ccaagctcca cttattgtaa atagaattca ccatgagcta 
acctaaaatg tacctgtgga gataaaacaa gagtgtaagt tagcaaagta ttaaataaaa 
tttcagggag cccctaaatt tatttttaag aactttagaa ctaattctct atatgcaaac 
actgattaac tcaaatatct tgtaagttca ttcatacatg gccttatttg aggcagtgta 
tttgtattca ctagcaaaat tcatgtcagt aaaatatttt tgaagcagtt tatttcccag 
atatttcact agtttgaaat agtcatttca gtgattagtc tgaatttcta ttgaagccta 
agctttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
547 



)E Homo sapiens cDNA FLJ13465 fis, clone PLACE1003493 , weakly similar to 
>E ENDOTHELIAL CELL MULTIMERIN PRECURSOR. 



T /trans 1 at ion= " MILSLLFSLGGPLGWGLLGAWAQASSTSLSDLQS SRTPGVWKAEA 

' T EDTSKDPVGRNWCPYPMSKLVTLLALCKTEKP^ 

PVYQVKQKVLTSLAWRCCPGYTGPNCEHHDSMAI PEPADPGDSHQEPQDGP VS FKPGHL 
AAVINEVEVQQEQQEHLLGDLQNDVHRVADSLPGLW^ 

RSLEQVLLPHVDTFLQVHFSPIWRSFNQSLHSLTQAIRNLSLDVEANRQAISRVQDSAV 
ARADFQELGAKFEAKVQENTQRVGQLRQDVEDRLHAQHFTLHRS I SELQADVDTKLKRL 
HKAQEAPGTNGSLVLATPGAGARPEPDSLQARLGQLQRNLSELHMTTARREEELQYTLE 
DMRATLTRHVDEIKELYSESDETFDQISKVERQVEELQVWHTALRELRVILMEKSLIME 
?T ENKEEVERQLLELNLTLQHLQGGHADLIKYViaDCNCQKXiy^ 

1 TQVSLDERRQLDGSSLQALQNAVDAVSLAVDAHKAEGERARAATSRLRSQVQAIiDDE^ 

ALKAAAAEARHEVRQI^SAFAALLEDAIiRHEAVIAALFGEEVLEEMSEQTPGPLPLSYE 
QIRVALQDAASGLQEQALGWDEIAARVTALEQASEPPRPAEHLEPSHDAGREEAATTAL 
AGLARELQSLSNDVKNVGRCCEAEAGAGAASLNASLDGLHNALFATQRSLEQHQRLFHS 
LFGNFQGLMEANVSLDLGKLQl^njSRKGKKQQKDLEAPRKRDKKEAEPLVD 
GALGAALWEAGS PVAFYAS FSEGTAALQTVKFNTTY INIGS S YFPEHGYFRAPERGVYL 

FAVSVEFGPGPGTGQLVFGGHHRTPVCTTGQGSGSTATVFAMAELQKGERVWFELTQGS 
•T ITKRSLSGTAFGGFLMFKT " ^ 



JWJI 



agctgagcct 


60 


ctgggctggg 


120 


cagagctcca 


180 


gttggacgta 


240 


aaaacagaga 


300 


cagaaagtca 


360 


gtgctgacct 


420 


cacgattcca 


480 


gatggaccag 


540 


caacaggaac 


600 


gacagcctgc 


660 


gcaaatcaaa 


720 


gtggacacct 


780 


cacagcctta 


840 


atctccagag 


900 


tttgaggcca 


960 


gaccgcctgc 


1020 


gtggacacca 


1080 


ctggtgttgg 


1140 


ctgggccagc 


1200 


gagttgcagt 


1260 


aaggaactgt 


1320 


gtggaggagc 


1380 


aagtctctga 


1440 


ctcacgctgc 


1500 


aattgccaga 


1560 


cgtgccctgg 


1620 


ctgcaggccc 


1680 


gagggcgagc 


1740 


gacgaggtgg 


1800 


cacagcgcct 


1860 


ctcttcgggg 


1920 


agctacgagc 


1980 


ctcggctggg 


2040 


cggccggcag 


2100 


gccctggccg 


2160 



ggctggcgcg 
aggccgaggc 
cactcttcgc 
ggaacttcca 
tgctgagcag 
agaaggaagc 
gcgcggcgct 
cggctgccct 
tccctgaaca 
ttgaatttgg 
cagtctgtac 
tgcagaaggg 
tgtcgggcac 
gatcagacat 
tggaggtcat 
tacgtggctt 
aatgcttctc 
tcagacagca 
gtgttttgct 
tcctcttctc 
aaaatggagg 
aacctgcttg 
gaaattagta 
ctaaaatatc 
gaggtaattg 
tctgatgatt 
tgaagatgcc 
cagccatgtg 



ggagctccag 
cggggccggg 
cactcagcgc 
agggctcatg 
gaaagggaag 
ggagccttfcg 
ctgggaggca 
gcagacagtg 
tggctacttc 
cccagggcca 
cactgggcag 
tgagcgagta 
tgcatttggg 
catggactcg 
tcagttggtc 
ccctgtaacc 
cgggcaggac 
cggcctgggc 
cctcctgtgg 
ttctccctta 
caacattttg 
gtctctatta 
tctttagttt 
atcttgaatt 
gatcttgggg 
ttataagttt 
ttggttcctc 
gaacagtgag 



agcctgagca 
gccgcctccc 
agcttggagc 
gaagccaacg 
aagcagcaga 
gtggacatac 
ggatcccctg 
aagttcaaca 
cgagcccctg 
ggcaccgggc 
gggagtggaa 
tggtttgagt 
ggcttcctga 
cccagctctc 
tgtctcttcc 
acatggggct 
atggtcctag 
tccaactctt 
ttggaaactt 
tcgtatgata 
gccaacattg 
gtcagtaatg 
cagtcattcc 
gtaatcccta 
gcggttcccc 
gatagttcct 
ttcactgtct 
tcaattaaac 



acgacgtcaa 
tcaacgcctc 
agcaccagcg 
tcagcctgga 
aagacctgga 
gggtcacagg 
tggccttcta 
ccacatacat 
agcgtggtgt 
agctggtgtt 
gcacagcaac 
taacccaggg 
tgtttaagac 
ctcggcctgg 
ctggaaacct 
tggccatttc 
gaagcctgaa 
caccacaccc 
ctgtacaaca 
gaaagacatt 
gaaagcacta 
aagacgacag 
ttgtaggata 
taatccccac 
catgctgttc 
cctgtgttca 
gccatgattg 
ctctttcctt 



gaatgtcggg 
ccttgacggc 
gctcttccac 
cctggggaag 
agctccccgg 
gcctgtgcca 
tgccagcttt 
caacattggc 
ctacctgttt 
tggaggtcac 
ggtctttgcc 
atcaataaca 



ctgaacccca 
ggctctggcc 
tctgcaaaga 
tccatgatga 
ccttggcttg 
tgtattctac 
ctttaaactt 



cttccccagg 
gagggcaatg 
cctggccaac 
tggtttagct 
atcaagggag 
ttgtgatagt 
ttctccttcc 
taagtttcct 
tataaatt 



cggtgctgcg 
ctccacaacg 
agcctctttg 
ctgcagacca 
aagagggaca 
ggtgccttgg 
tcagaaggga 
agcagctact 
gcagtgagcg 
catcggactc 
atggctgagc 
aagagaagcc 
gccccaatct 
aaggatgggc 
tggtgtggtg 
gaaggactgg 
gcatgccttc 
aacttctttg 
ttctcttgct 
aggaatgttt 
ggattaaacc 
caagggaaag 
gtgcccccac 
agatcaggtg 
tctcacgaga 
tgccaccttg 
gaggcctccc 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3828 



Homo sapiens heat shock 27kDa protein 1, mRNA (cDNA clone MGC:8509 
IMAGE:2822325) , complete cds. 

/ trans 1 at ione "MTERRVPFSLIiRGPSWDPFRDWYPHSRLFDQAFGLiPRIjPEEWSQW 
LGGSS WPGYVRPLPPAAIES PAVAAPAYSRALSRQLS SGVSEIRHTADRWRVSLDVNHF 
APDELTVKTKDGWE I TGKHEERQDEHG YI SRCFTRKYTLPPGVDPTQVS S S L S PEGTL 
TVEAPMPKLATQSNEITIPVTFESRAQLGGPEAAKSDETAAK" 

ccgcctgcta aaaatacccg actggaggag cataaaagcg cagccgagcc cagcgccccg 
cacttttctg agcagacgtc cagagcagag tcagccagca tgaccgagcg ccgcgtcccc 
ttctcgctcc tgcggggccc cagctgggac cccttccgcg actggtaccc gcatagccgc 
ctcttcgacc aggccttcgg gctgccccgg ctgccggagg agtggtcgca gtggttaggc 
ggcagcagct ggccaggcta cgtgcgcccc ctgccccccg ccgccatcga gagccccgca 
gtggccgcgc ccgcctacag ccgcgcgctc agccggcaac tcagcagcgg ggtctcggag 
atccggcaca ctgcggaccg ctggcgcgtg tccctggatg tcaaccactt cgccccggac 
gagctgacgg tcaagaccaa ggatggcgtg gtggagatca ccggcaagca cgaggagcgg 
caggacgagc atggctacat ctcccggtgc ttcacgcgga aatacacgct gccccccggt 
gtggacccca cccaagtttc ctcctccctg tcccctgagg gcacactgac cgtggaggcc 
cccatgccca agctagccac gcagtccaac gagatcacca tcccagtcac cttcgagtcg 
cgggcccagc ttgggggccc agaagctgca aaatccgatg agactgccgc caagtaaagc 
cttagcccgg atgcccaccc ctgctgccgc cactggctgt gcctcccccg ccacctgtgt 
gttcttttga tacatttatc ttctgttttt ctcaaataaa gttcaaagca ccccccaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaa 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
867 



Homo sapiens carcinoembryonic antigen (COM2) iriRHA, complete cds. 

carcinoembryonic antigen. 

/ translations "MGSPSACPYRVCIPWQGIjIjIiTASLLTFWNIiPNSAQTNIDWPFNV 
AEGKEVLLVVHKESQNLYGYNWYKGERVHANYRIIGYVKNISQENAPGPAHNGRETIYP 

NGTLLIQNVTHNDAGFYTLHV1KEHLVHEEVTRQFYVFSEPPKPS ITSNNFNPVENKDI 
VVLTCQPETQNTTYLVmVNWQSLLVSPRIil.I.STDNRTLVLLSATKNDIGPyECEIQNPV 

GASRSDPVTLNVRYESVQASSPDLSAGTAVSIMIGVIiAGMALI" 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
811 



ccatgggttc cccttcagcc tgtccataca gagtgtgcat tccctggcag gggctcctgc 
tcacagcctc gcttttaacc ttctggaacc tgccaaacag tgcccagacc aatattgatg 
tcgtgccgtt caatgtcgca gaagggaagg aggtccttct agtagtccat aatgagtccc 
agaatcttta tggctacaac tggtacaaag gggaaagggt gcatgccaac tatcgaatta 
taggatatgt aaaaaatata agtcaagaaa atgccccagg gcccgcacac aacggtcgag 
agacaatata ccccaatgga accctgctga tccagaacgt tacccacaat gacgcaggat 
tctataccct acacgttata aaagaaaatc ttgtgaatga agaagtaacc agacaattct 
acgtattctc ggagccaccc aagccctcca tcaccagcaa caacttcaat ccggtggaga 
acaaagatat tgtggtttta acctgtcaac ctgagactca gaacacaacc tacctgtggt 
gggtaaacaa tcagagcctc ctggtcagtc ccaggctgct gctctccact gacaacagga 
ccctcgttct actcagcgcc acaaagaatg acataggacc ctatgaatgt gaaatacaga 
acccagtggg tgccagccgc agtgacccag tcaccctgaa tgtccgctat gagtcagtac 
aagcalgt?c acctgacctc tcagctggga ccgotgtcag catcatgatt ggagtactgg 
ctgggatggc tctgatatag cagccttggt g 



DE Homo sapiens keratin 7, mRNA (cDNA clone MGC:3625 IMAGE : 3610347) , complete 
DE cds . * 



FT / translat ion= " MS IHFS S PVFTSRS AAFSGRGAQVRLS S ARPGGLGS S SL YGLGAS 

Fr R P RV AVRSAYGGPVGAGIREVTINQSLLAPLRLDADPS 

F IDKVRFIiEQQlHCIiLETKWTIjLQEQKS AKS SRLPD I FEAQI AGLRGQLEALQVDGGRIiE 
AELRSMQDWEDFKNKYEDE INRRTAAENEFVVLKKDVDAAYMSKVELEAKVDALNDE I 
NFLRTLNETELTELQSQISDTSVVLSMDNSRSLDIJDGIIAEVKAQYE 
WYQTKFETLQAQ AGKHGDDLRNTRNE I S EMNRA I QRLQAE I DNI KNTQRAKLEAAI AEAE 
ERGELALKDARAKQEELEAALQRAKQDMARQLRE YQELMS VKLAIiD I E IATYRKLLEGE 
ESRLAGDGVGAVNI SVMNSTGGS S SGGGIGLTLGGTMGSNALS FS S S AGPGLLKAYS IR 



FT 
FT 
FT 
FT 
FT 
FT 

FT TASASRRS ARD 11 



'/ 



ctcctcctcg cccgccgcta ggtccatccc ggcccagcca ccatgtccat ccacttcagc 60 

tccccggtat tcacctcgcg ctcagccgcc ttctcgggcc gcggcgccca ggtgcgcctg 120 

agctccgctc gccccggcgg ccttggcagc agcagcctct acggcctcgg cgcctcgcgg 180 

ccgcgcgtgg ccgtgcgctc tgcctatggg ggcccggtgg gcgccggcat ccgcgaggtc 240 

accattaacc agagcctgct ggccccgctg cggctggacg ccgacccctc cctccagcgg 3 00 

gtgcgccagg aggagagcga gcagatcaag accctcaaca acaagtttgc ctccttcatc 360 

gacaaggtgc ggtttctgga gcagcagaac aagctgctgg agaccaagtg gacgctgctg 420 

caggagcaga agtcggccaa gagcagccgc ctcccagaca tctttgaggc ccagattgct 480 

ggccttcggg gtcagcttga ggcactgcag gtggatgggg gccgcctgga ggcggagctg 540 

cggagcatgc aggatgtggt ggaggacttc aagaataagt acgaagatga aattaaccgc 600 

cgcacagctg ctgagaatga gtttgtggtg ctgaagaagg atgtggatgc tgcctacatg 660 

agcaaggtgg agctggaggc caaggtggat gccctgaatg atgagatcaa cttcctcagg 720 

accctcaatg agacggagtt gacagagctg cagtcccaga tctccgacac atctgtggtg 780 

ctgtccatgg acaacagtcg ctccctggac ctggacggca tcatcgctga ggtcaaggca 840 

cagtatgagg agatggccaa atgcagccgg gctgaggctg aagcctggta ccagaccaag 900 

tttgagaccc tccaggccca ggctgggaag catggggacg acctccggaa tacccggaat 960 

gagatttcag agatgaaccg ggccatccag aggctgcagg ctgagatcga caacatcaag 1020 

aaccagcgtg ccaagttgga ggccgccatt gccgaggctg aggagcgtgg ggagctggcg 1080 

ctcaaggatg ctcgtgccaa gcaggaggag ctggaagccg ccctgcagcg ggccaagcag 1140 

gatatggcac ggcagctgcg tgagtaccag gaactcatga gcgtgaagct ggccctggac 12 00 

atcgagatcg ccacctaccg caagctgctg gagggcgagg agagccggtt ggctggagat 1260 

ggagtgggag ccgtgaatat ctctgtgatg aattccactg gtggcagtag cagtggcggt 1320 

ggcattgggc tgaccctcgg gggaaccatg ggcagcaatg ccctgagctt ctccagcagt 1380 

gcgggtcctg ggctcctgaa ggcttattcc atccggaccg catccgccag tcgcaggagt 1440 

gcccgcgact gagccgcctc ccaccactcc actcctccag ccaccaccca caatcacaag 1500 

aagattccca cccctgcctc ccatgcctgg tcccaagaca gtgagacagt ctggaaagtg 1560 

atgtcagaat agcttccaat aaagcagcct cattctgagg cctgagtgat ccacgtgaaa 1620 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa ^ 1668 



Homo sapiens hxCT mRNA for cystine/glutamate exchanger, complete cds. 

/ trans 1 at ion= " MVRKPWSTI SKGGYIiQGNVNGRIiPSLGNKEPPGQEKVQLKRKVT 
LLRGVSIIIGTIIGAGIFISPKGVLQNTGSVGMSLTIWTVCGVLSLFGALSYAELGTTI 

KKSGGHYTY ILEVFGPIjPAFVRVWVELL I IRPAATAVI SLAFGRYI LEPFF IQCEI PEL 
AIKLITAVGITVVMVLNSMSVSWSARIQIFI.TFCKLTAILIIIVPGVMQLIKGQTQNFK 
DAFSGRDSSITRLPLAFYYGMYAYAGWFYLNFVTEEVENPEKTIPLAICISMAIVTIGY 
VLTNVAYFTTINAEELIjLSNAVAVTFSERLLGNFSIiAVPIFVAIiSCFGSMNGGVFAVSR 
IjFYVASREGHIiPEILSMIHVRKHTPLPAVIVIjHPIiTMIMLFSGDIiDSLLNFIiSFARWLF 
IGLAVAGLIYLRYKCPD^1HRPFKVPI.FIPALFSFTCLFMVAI ^ SLYSDPFSTGIGFVITL 

TGVPAYYLFI IVTOKKPRWFRIMSGFIjALMPAQACDM " 



cctgtgaaca ctatagcgct gagagagaca gtctgaaagc agaggaagac atcgatcagt 
aacaccaaga gacaccaaag ttgaaagttt tgttttcttt ccctctgttt t^ttttccc 
ccgtgtgtcc ctactatggt cagaaagcct gttgtgtcca ccatctccaa aggaggttac 
ctgcagggaa atgttaacgg gaggctgcct tccctgggca acaaggagcc «ctgggcag 
gagaaagtgc agctgaagag gaaagtcact ttactgaggg gagtctccat tatcattggc 
accatcattg gagcaggaat cttcatctct cctaagggcg tgctccagaa cacgggcagc 
gtgggcatg? ctctgaccat ctggacggtg tgtggggtcc tgtcactatt tggagctttg 
Icttatgctg aattgggaac aactataaag aaatctggag gtcattacac atatattttg 
gaagtctttg gtccattacc agcttttgta cgagtctggg tggaactcct cataatacgc 
ccticagcta ctgctgtgat atccctggca tttggacgct acattctgga accatttttt 
attcaa?gtg aaatccctga acttgcgatc aagctcatta cagctgtggg "taactgta 
gtgatggtcc taaatagcat gagtgtcagc tggagcgccc ggatccagat tttcttaacc 
?Sgcaagc tcacaglaat tctgataatt atagtccctg gagttatgca gctaattaaa 
ggtcaaacgc agaactttaa agacgccttt tcaggaagag attcaagtat tacgcggttg 
cf actggc?t tltattatgg aatgtatgca tatgctggct ggttttacct caactttg" 
actgaagaag tagaaaaccc tgaaaaaacc attccccttg caatatgtat atccatggcc 
attgtcacca ttggctatgt gctgacaaat gtggcctact ttacgaccat taatgctgag 
gagctgctgc tttcaaatgc agtggcagtg accttttctg agcggctact gggaaatttc 
tcattagcag ttccgatctt tgttgccctc tcctgctttg gctccatgaa cggtggtgtg 
Stgctg"? ccaggttatt c?atgttgcg tctcgagagg gtcaccttcc agaaatcctc 
tccatgattc atg?ccgcaa gcacactcct ctaccagctg ttattgtttt ^accctttg 
acaatgataa tgctcttctc tggagacctc gacagtcttt tgaatttcct cagttttgcc 
aggtggcttt ttattgggct ggcagttgct gggctgattt atcttcgata caaatgccca 
gSaScatc gtcctttcaa ggtgccactg ttcatcccag ctttgttttc cttcacatgc 
ctcttcatgg ttgccctttc cctctattcg gacccattta gtacagggat tggcttcgtc 
atcactctga ctggagtccc tgcgtattat ctctttatta tatgggacaa gaaacccagg 
tggtttagaa taatgtcagg gttcctagca ctgatgcctg cacaagcatg tgatatgtga 
aataaaatgg attcttctat agctaaatga gttccctctg gggagagttc tggtactgca 
atcacaatgc cagatggtgt ttatgggcta tttgtgtaag taagtggtaa gatgctatga 
agtaagtg?g tt?gtlltca tcttatggaa actcttgatg catgtgcttt tgtatggaat 
aaattttggt gcaatatgat gtcattcaac tttgcattga attgaatttt ggttgtattt 
atatgta?ta lacctgtcac gcttctagtt gcttcaacca ttttataacc atttttgtac 
atatStact tgaaaatatt ttaaatggaa atttaaataa acatttgata gtttacataa 



taaaaaaaaa aaaaaaaaaa 



60 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2000 



DE 
DE 

?*T 

?T 
?T 



Homo sapiens eukaryotic translation elongation factor l alpha 2, mRNA (cDNA 
clone MGC: 83 62 IMAGE:2819899) , complete cds. 

/ 1 rans 1 at i on= " MGKEKTHINI WIGHVDSGKSTTTGHIj I YKCGG IDKRTIEKFEKE 
AAEMGKGSFKYAWVLDKLKAERERG1TIDISLWKPETTKYYITI1DAPGHRDFIKNMIT 
GTS Q AD C AVL I VAAGVGE FEAG I S KNGQTREHALLAYTLiGVKQL I VG VNKMD STE PAYS 
EKR YDE I VKEVS AYI KKIGYNPATVPFVP I SGWHGDNMLEP S PNMP WFKGWKVERKEGN 
ASGVSLLEALDTILPPTRPTDKPIiRIiPLQDVYKIGGIGTVPVGRVETGIIjRPGMVVTFA 
PVNITTEVKSVEMHHFJ^SEALPGDNVGFNVKNVSVKDIRRGNVCGDSKSDPPQEAAQF 
TSQVIILNHPGQISAGYSPVIDCHTAHIACKFAELKEKIDRRSGKKLEDNPKSLKSGDA 
AIVEMVPGKPMC^SFSQYPPLGRFAVRDMRQTVAVGVIKNVEKKSGGAGKVTKSAQKA 



— — ■ w -* — * w<w* M «3 wv. u. L. I_ L> 

caaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



cccgccagcc 


cctcacactc 


60 


ggtcatcggc 


cacgtggact 


120 


cggaggtatt 


gacaaaagga 


180 


gggatccttc 


aagtatgcct 


240 


caccatcgac 


atctccctct 


300 


tgcccccggc 


caccgcgact 


360 


cgcagtgctg 


atcgtggcgg 


420 


gcagacgcgg 


gagcatgccc 


480 


cgtgaacaaa 


atggactcca 


540 


caaggaagtc 


agcgcctaca 


600 


gcccatctcc 


ggctggcacg 


660 


caagggctgg 


aaggtggagc 


720 


cctggacacc 


atcctgcccc 


780 


ggacgtgtac 


aagattggcg 


840 


cctgcggccg 


ggcatggtgg 


900 


agtggagatg 


caccacgagg 


960 


tgtgaagaac 


gtgtcggtga 


1020 


tgacccgccg 


caggaggctg 


1080 


gcagattagc 


gccggctact 


1140 


gtttgcggag 


ctgaaggaga 


1200 


caagtccctg 


aagtctggag 


1260 


tgtggagagc 


ttctcccagt 


1320 


gacggtggcc 


gtaggcgtca 


1380 


caccaagtcg 


gcgcagaagg 


1440 


gaccctcccc 


ggcggcgccg 


1500 


cgcgcgccgc 


tccggcgccc 


1560 


gaggccctcg gtcagcgact 


1620 


gaaaggcgcc 


cccgccccag 


1680 


accaataaac 


tgagcgaccc 


1740 


a 




1781 



Homo sapiens cDNA clone:HEMBA1000726, 3' end, expressed in whole embryo, 
mainly head. 

gagacggagt ctcgctcttg tcacccaggt tggagtgcag tggcacaatc tcggctcact 
acaacctcca cctcctgtgt ttaaacgatt ctcctgcttc agcctcctga gtagctggaa 
?tacaggccc tgccaccacc cccccgctaa tttttgtcta tttttttttt ttagtagaga 
cggggStca ccatgttggc tagtctggtc ttgaactcct gactgacctc a 9 ac ^ a ^J 
c?cgcctcag actcccaaag tgtcaggatt acaggcgtta gccaccatac c Jggcctgct 
cccagttttt acaagatgtt aattcccaat aatctgagag caatgtgtta atatgaatat 
taattcttct aaatgaatat tcatccttat ttcctacttg tataggtgga tgaataaaga 
tccaatagta taatagaaag actattagta agaatgccag aaggncagtc tcatgcacct 
ggtgaaataa accaaccaac caacctgaan tctaaagctt gngtggcaag taccactgtg 
gggaagtgta gaattaacnc tcttttccta agggtc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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Homo sapiens MDG1 mRNA, complete cds. 
HTC. 



?T / trans lation= n MATPQSIFIFAICIIjMITELI3jASKSYYDILGVPKSASERQIKKA 

' T FHKLAMKYHPDKNKS PDAEAKFRE I AEAYETLSDANRRKE YDTLGHS AFTSGKGQRGSG 

?T SSFEQSFNFNFDDLFKDFGFFGQNQNTGSKKRFENHFQTRQDGGSSRQRHHFQEFSFGG 

?T GLFDDMFEDMEKMFSFSGFDSTNQHTVQTENRFHGSSKHCRTVTQRRGNMV^ 
? T Q« 

tagctggctg agaggggact gggcgccggc ggggaaggag gagcgctagg tcggtgtacg 60 

accgagatta gggtgcgtgc cagctccggg aggccgcggt gaggggccgg gcccaagctg 12 0 

ccgacccgag ccgatcgtca gggtcgccag cgcctcagct ctgtggagga gcagcagtag 180 

tcggagggtg caggatatta gaaatggcta ctccccagtc aattttcatc tttgcaatct 240 

gcattttaat gataacagaa ttaattctgg cctcaaaaag ctactatgat atcttaggtg 3 00 

tgccaaaatc ggcatcagag cgccaaatca agaaggcctt tcacaagttg gccatgaagt 360 

accaccctga caaaaataag agcccggatg ctgaagcaaa attcagagag attgcagaag 42 0 

catatgaaac actctcagat gctaatagac gaaaagagta tgatacactt ggacacagtg 480 

cttttactag tggtaaagga caaagaggta gtggaagttc ttttgagcag tcatttaact 54 0 

tcaattttga tgacttattt aaagactttg gcttttttgg tcaaaaccaa aacactggat 600 

ccaagaagcg ttttgaaaat catttccaga cacgccagga tggtggttcc agtagacaaa 660 

ggcatcattt ccaagaattt tcttttggag gtggattatt tgatgacatg tttgaagata 720 

tggagaaaat gttttctttt agtggttttg actctaccaa tcagcataca gtacagactg 780 

aaaatagatt tcatggatct agcaagcact gcaggactgt cactcaacga agaggaaata 840 

tggttactac atacactgac tgttcaggac agtagttctt attctattct cactaaatcc 900 

aactggttga ctcttcctca ttatctttga tgctaaacaa ttttctgtga actattttga 960 

caagtgcatg atttcacttt aaacaatttg atatagctat taaatatatt taagggtttt 1020 

tttttttttg acaaattcaa cattcaacga gtagacaaaa tgctaattat ttccctgatt 1080 

aggaaagttt ctttaaaaaa cacgtaattt tgcctagtgc tttttctcta cctgcccttg 1140 

ggctcactaa tatcaccagt attattacca agaaaatatt gagtttacct gattaaactt 12 00 

taaaagttaa ttgtagattt aaattgtgtg aacctaatga tttttgcagt gaaaccttta 1260 

ctaattcaaa gttgcatgtt ctatgacatc tgtgacttgc gttgcagagt gtacatgaaa 1320 

ctgtataatt gagtcattca gtaaaggaga acagtatctt ggttaattgc tactgaaagg 13 80 

ttgagaaagg aatggtttga tatttaccac agcgctgtgc ctttctacag tagaactggg 1440 

gtaaaggaaa tggttttatt gcccatagtc atttaggctg gaaaaaagtt gaaaacttaa 1500 

cgaaatattg ccaagagatt gttatgtgtt tggttccagc ctaaaaatga ttttgtagtg 1560 

ttgaaatcat agctacttac atagcttttt catatttctt tcttagttgt tggcactctt 1620 

aggtcttagt atggatttat gtgtttgtgt gtgtgtagtt tatcctctct ctcatcttta 1680 

tctagagatt gactgatacc tcattctgtt tgtaaaacca gccagtaatt tctgtgcaac 1740 

cttactatgt gcaatatttt taaatcctga gaaatgtgtg cttttgtttt cggatagact 1800 

tatttcttta gttctgcact tttccacatt atactccata tgagtattaa tcctatggat 1860 

acatattaaa acaagtgtct catacaacat tgtatgtgag agaaatataa atatttacaa 1920 

cctgaaaaa 1QOQ 



Homo sapiens prostate stem cell antigen (PSCA) mRNA, complete cds . 

/ trans 1 at ion= " MKAVLLALLMAGLALQPGTAIjLCYS CKAQVSNEDCLQVENCTQLG 
EQCWTARIRAVGLLWISKGCSIiNCTODSQDYYVGKKNITCCDTDLC^ 
AILALLPALGLLLWGPGQL » 

agggagaggc agtgaccatg aaggctgtgc tgcttgccct gttgatggca ggcttggccc 
tgcagccagg cactgccctg ctgtgctact cctgcaaagc ccaggtgagc aacgaggact 
gcctgcaggt ggagaactgc acccagctgg gggagcagtg ctggaccgcg cgcatccgcg 
cagttggcct cctgaccgtc atcagcaaag gctgcagctt gaactgcgtg gatgactcac 
aggactacta cgtgggcaag aagaacatca cgtgctgtga caccgacttg tgcaacgcca 
gcggggccca tgccctgcag ccggctgccg ccatccttgc gctgctccct gcactcggcc 
tgctgctctg gggacccggc cagctatagg ctctgggggg ccccgctgca gcccacactg 
ggtgtggtgc cccaggcctt tgtgccactc ctcacagaac ctggcccagt gggagcctgt 
cctggttcct gaggcacatc ctaacgcaag tttgaccatg tatgtttgca ccccttttcc 
ccnaaccctg accttcccat gggccttttc caggattccn accnggcaga tcagttttag 
tganacanat ccgcntgcag atggcccctc caaccntttn tgttgntgtt tccatggccc 660 
agcattttcc acccttaacc ctgtgttcag gcacttnttc ccccaggaag ccttccctgc 72 0 

ccaccccatt tatgaattga gccaggtttg gtccgtggtg tcccccgcac ccagcagggg ™n 
acaggcaatc aggagggccc agtaaaggct gagatgaagt ggactgagta gaactggagg 
acaagagttg acgtgagttc ctgggagttt ccagagatgg ggcctggagg cctggaggaa 
ggggccaggc ctcacatttg tggggntccc gaatggcagc ctgagcacag cgtaggccct 
taataaacac ctgttggata agccaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



780 
840 
900 
960 
990 



DE Human arginine-rich protein (ARP) gene, complete cds. 

FT / trans la tion= " MGKWHVGGRRGSPRQWGATARGRDIjEAVRRGGCGSVGRRRQRRRR 

RRRRMRM4RRMWATnf3T.2VT71JirJVT.etrr TJ/5C5TJ7VT TSWrr->/-rr*<T. n i- ~— . 



FT 
FT 
FT 



, -*»u iuu - "va^mrivwKKUB^KUWUAXAKeRDIjEAVRRGGCGSVGRRRQRRRR 

RRRRMRRMRRMWATQGLA.VRVAIjSVTjPGSRAJjRPGDCEVC I s YLGRFYQDLKDRDVTFS 
PATIElTOIjIKFCREAPGKENRLCrX'IGATDDAATKIlNEVSKPIJ'JIHIPVEKlCEKIjKK 
KDSQ I CELKYDKQIDLS TVDLKKLRVKELKKI IiDDWGETCKGCAEKSD YIRKINELMPK 
?T YAPKAAS APTDL « 

cttcggtcct gctgtagtgc cttctgcgcc aggcccggtt caatcagcgg ccacaactgt 60 

ctagggctca gacaccacca gccaatgagg gagggcacgt ggagccgcgt ctgggctcgc 120 

ggctcctgac caatggggaa gtggcatgtg ggagggcgcc ggggttcccc ccgccaatgg 180 

ggagctacgg cgcgcggccg ggacttggag gcggtgcggc gcggcgggtg cggttcagtc 240 

ggtcggcggc ggcagcggag gaggaggagg aggaggagga tgaggaggat gaggaggatg 300 

tgggccacgc aggggctggc ggtgcgcgtg gctctgagcg tgctgccggg cagccgggcg 360 

ctgcggccgg gcgactgcga agtttgtatt tcttatctgg gaagatttta ccaggacctc 420 

aaagacagag atgtcacatt ctcaccagcc actattgaaa acgaacttat aaagttctgc 480 

cgggaagcaa gaggcaaaga gaatcggttg tgctactata tcggggccac agatgatgca 540 

gccaccaaaa tcatcaatga ggtatcaaag cctctggccc accacatccc tgtggagaag 600 

atctgtgaga agcttaagaa gaaggacagc cagatatgtg agcttaagta tgacaagcag 660 

atcgacctga gcacagtgga cctgaagaag ctccgagtta aagagctgaa gaagattctg 720 

gatgactggg gggagacatg caaaggctgt gcagaaaagt ctgactacat ccggaagata 780 

aatgaactga tgcctaaata tgcccccaag gcagccagtg caccgaccga tttgtagtct 840 

gctcaatctc tgttgcacct gagggggaaa aaacagttca actgcttact cccaaaacag 900 

lilt llf 3 f t ^ attttt taagtgggct cctgacaata ctgtatcaga tgtgaagcct 960 

ggagctttcc tgatgatgct ggccctacag tacccccatg aggggattcc cttccttctg 1020 
ttgctggtgt actctaggac ttcaaagtgt gtctgggatt tttttattaa agaaaaaaaa 
tttctagctg tcaaaaaaaa aaa 



V 



1080 
1103 




E Homo sapiens interleukin 11 receptor, alpha, transcript variant 1, 
E (cDNA clone MGC:2146 IMAGE : 3502059) , complete cds. 



99999ctgta gctggtgaga ggaagtccta gaggctatgg acactctgct gctgggatca 
ccgagatgag cagcagctgc tcagggctga gcagggtcct ggtggccgtg gctacagccc 
tggtgtctgc ctcctccccc tgcccccagg cctggggccc cccaggggtc cagtatgggc 
agccaggcag gtccgtgaag ctgtgttgtc ctggagtgac tgccggggac ccagtgtcct 
ggtttcggga tggggagcca aagctgctcc agggacctga ctctgggcta gggcatgaac 
tggtcctggc ccaggcagac agcactgatg agggcaccta catctgccag accctggatg 
gtgcacttgg gggcacagtg accctgcagc tgggctaccc tccagcccgc cctgttgtct 
cctgccaagc agccgactat gagaacttct cttgcacttg gagtcccagc cagatcagcg 
gtttacccac ccgctacctc acctcctaca ggaagaagac agtcctagga gctgatagcc 
agaggaggag tccatccaca gggccctggc catgcccaca ggatccccta ggggctgccc 
gctgtgttgt ccacggggct gagttctgga gccagtaccg gattaatgtg actgaggtga 
acccactggg tgccagcaca cgcctgctgg atgtgagctt gcagagcatc ttgcgccctg 
acccacccca gggcctgcgg gtagagtcag taccaggtta cccccgacgc ctgcgagcca 
gctggacata ccctgcctcc tggccgtgcc agccccactt cctgctcaag ttccgtttgc 
agtaccgtcc ggcgcagcat ccagcctggt ccacggtgga gccagctgga ctggaggagg 
tgatcacaga tgctgtggct gggctgcccc atgctgtacg agtcagtgcc cgggactttc 
tagatgctgg cacctggagc acctggagcc cggaggcctg gggaactccg agcactggga 
ccataccaaa ggagatacca gcatggggcc agctacacac gcagccagag gtggagcctc 
aggtggacag ccctgctcct ccaaggccct ccctccaacc acaccctcgg ctacttgatc 
acagggactc tgtggagcag gtagctgtgc tggcgtcttt gggaatcctt tctttcctgg 
gactggtggc tggggccctg gcactggggc tctggctgag gctgagacgg ggtgggaagg 
atggatcccc aaagcctggg ttcttggcct cagtgattcc agtggacagg cgtccaggag 
ctccaaacct gtagaggacc caggagggct tcggcagatt ccacctataa ttctgtcttg 
ctggtgtgga tagaaaccag gcaggacagt agatccctat ggttggatct cagctggaag 
ttctgtttgg agcccatttc tgtgagaccc tgtatttcaa atttgcagct gaaaggtgct 
tgtacctctg atttcacccc agagttggag ttctgctcaa ggaacgtgtg taatgtgtac 
atctgtgtcc atgtgtgacc atgtgtctgt gaggcaggga acatgtattc tctgcatgca 
tgtatgtagg tgcctgggga gtgtgtgtgg gtccttggct cttggccttt ccccttgcag 
gggttgtgca ggtgtgaata aagagaataa ggaagttctt ggaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacctc ggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1783 



T 
T 
T 
T 
T 
T 
T 
T 



/translation=»MSSSCSGIiSRVLVAVATAIiVSASSPCPQAWGPPGVQYGQPGRSVK 
LCCPGVTAGDPVSWPRDGEPKLLQGPDSGLGHELVLAQADSTDEGTYICQTIiDGALGGT 
VTI1QI1GYPPARPWS CQAADYENFS CTWS PSQ I SGLPTRYLTS YRKKTVIiGADSQRRS P 
STGPWPCPQDPLGAARCVVHGAEFWSQYRIim'EVNP 

GLRVESVPGYPRRLRASWTYPASWPCQPHFLLKFRLQYRPAQHPAWSTVEPAGLEEVIT 
DAVAGIiPHAVRVSARDFLDAGTWSTWS PEAWGTP STGTI PKE I PAWGQI/HTQPEVEPQV 
DSPAPPRPSLQPHPRLIiDHRDSVEQVAVLASLGILSF 
GSPKPGFLASVIPVDRRPGAPNIi " 



7 



Homo sapiens mRNA; cDNA DKFZp564O2071 (from clone DKFZpS 6402 071) ; complete 



/ translation^ n MPSLWDRF S S SSTS 3 S PS SLPRTPTPDRPPRS AWGS ATREEGFDR 

STSLESSDCESLDSSNSGFGPEEDTAYLDGVSLPDFEIiSDPE^ 

AQARLGSRRPARLLMPSQLVSQVGKEIJjRIAYSEPCGLR 

AIjDPSIjVPTFQLTLVLRLDSRIjWPKIQGLFSSANSPFLPGFSQSLTLSTGFRVIKICKL 
SSEQLPIEEC" 



ttaaaacaaa aaaaaaaaaa aaaaaaaa 



ggacgggtct 


gggcggctct 


60 


gacgcacttg 


tcttagcagt 


12 0 


ctctgagttc 


atcagcaaac 


180 


gcttctcgtc 


gtcgtccacc 


240 


atcggccgcc 


gcgctcagcc 


300 


cgagcctgga 


gagctcggac 


360 


aggaagacac 


ggcttacctg 


420 


ctgaggatga 


acacttgtgt 


480 


cgcggctggg 


ctctcgacgc 


540 


tgggcaaaga 


actactgcgc 


600 


tggacgtctg 


cgtggagcag 


. 660 


ccagcctggt 


gcccaccttc 


720 


ccaagatcca 


ggggctgttt 


780 


ccctgacgct 


gagcactggc 


840 


tgcccattga 


ggagtgttga 


900 


gacgactgaa 


cttttggggt 


960 


tggaaaactg 


aggcagccac 


1020 


attgagttgt 


gtgcgggtgg 


1080 


aaacaaaggc 


ttaggggcca 


1140 


ttatttttgt 


tactgacagt 


1200 


cgccccagcc 


tggcccaggg 


1260 


agggggaggt 


cggaggtcgt 


1320 


tgtgtttgtt 


gtttgttttg 


1380 


gtaggcagct 


atcttacaga 


1440 


cacttgggat 


ctttgacact 


1500 


atcgtgtact 


gcagagttga 


1560 


cttcccggga 


ggagtgccat 


1620 


actcactgtt 


catgaataca 


1680 


ttttctaata 


aacatgtttg 


1740 



DE Homo sapiens collagen alpha 3 type IX (COL9A3) mRNA, complete cds. 
alpha-3 type IX collagen; COL9A3 gene; collagen. 

/ trans lat ion= "MAGPRACAPLLLLI.LI.GQI.LAAAGAQRVGLPGPPGPPGRPGKPGQ 
DGIDGEAGPPGLPGPPGPKGAPGKPGKPGEAGLPGLPGVDGLTGRDGPPGPKGAPGERG 
SLGPPGPPGLGGKGLPGPPGEAGVSGPPGGIGLRGPPGPPGIiPGLPGPPGPPGPPGHPG 
VLPEGATDLQCPSICPPGPPGPPGMPGFKGPTGYKGEQGEVGKDGEKGDPGPPGPAGLP 
GSVGLQGPRGLRGIiPGPLGPPGDRGPIGFRGPPGIPGAPGKAGDRGERGPEGFRGPKGD 
LGRPGPKGTPGVAGPSGEPGMPGKDGQNGVPGLDGQKGEAGRNGAPGEKGPNGLPGliPG 
RAGSKGEKGERGRAGELGEAGPSGEPGVPGDAGMPGERGEAGHRGSAGAIjGPQGPPGAP 
GVRGFQGQKGSMGDPGLPGPQGLRGDVGDRGPGGAEGPKGDQGIAGSDGLPGDKGELGP 
SGLVGPKGESGSRGELGPKGTQGPNGTSGVQGVPGPPGPLGIiQGVPGVPGITGKPGVPG 
KFJ^EQRIRELCGGMISEQIAQLAAHLRKPLAPGSIGRPGPAGPPGPPGPPGSIGHPGA 
RGPPGYRGPTGELGDPGPRGNQGDRGDKGAAGAGLDGPEGDQGPQGPQGVPGTSKDGQD 
GAPGEPGPPGDPGI»PGAIGAQGTPGICDTSACQGAVLGGVGEKSGSRSS n 



60 
120 
X80 
240 
300 
360 
420 
480 



atggccgggc cgcgcgcgtg cgcgccgctc ctgctcctgc tcctcctcgg gcagcttctg 
gcggccgccg gggcgcagag agtgggactc cccggccccc ccggcccccc agggcgccct 
gggaagcccg gccaggacgg cattgacgga gaagctggtc ctccaggtct gcctggtccc 
ccgggaccaa agggggcccc aggaaagccg gggaaaccag gagaggctgg gctgccggga 
ctgccgggtg tggatggtct gactggacga gatggacccc ctggacccaa gggtgcccct 
ggggaacggg gaagtctggg acccccgggg ccgcccgggc tggggggcaa aggcctccct 
ggaccccccg gagaggcagg agtgagcggc cccccaggtg ggatcggcct ccgcggcccc 
ccgggacctc ctggactccc cggcctccct ggtcccccag gacctcccgg accccctgga 
cacccaggag tcctccctga aggcgctact gaccttcagt gcccaagtat ctgcccgcca 540 
ggtcccccag ggccccctgg aatgccaggg ttcaagggac ccactggcta caaaggcgag 
cagggggaag tcggcaagga cggcgagaag ggtgaccctg gcccccctgg gcccgccggc 
ctcccgggca gcgtggggct gcagggcccc cggggattac gaggactgcc agggccactc 
gggccccctg gggaccgggg tcccattggg ttccgagggc cgcctgggat cccaggagcg 
cctgggaaag cgggtgaccg aggcgagagg ggcccagaag ggttccgcgg ccccaagggt 
gacctcggca gacctggtcc caagggaacc cccggagtgg ccgggccaag cggagagccg 
ggcatgccag gcaaggacgg ccagaatggc gtgccaggac tcgatggcca gaagggagag 
gctggtcgca acggtgctcc gggagagaag ggccccaacg ggctgccggg cctccctgga 
cgagcggggt ccaaaggcga gaagggagaa cggggcagag ctggggagct gggtgaggcc 1080 
ggcccctctg gagagccagg cgtccctgga gatgctggca tgcctgggga gcgcggtgag 1140 
gctggccacc ggggctcagc gggggccctc ggcccacaag gccctcccgg agcccctggt 
gtccgaggct tccagggcca gaagggcagc atgggagacc ccggccttcc aggcccccag 
ggcctccgag gtgacgtggg cgaccggggt ccgggaggtg ccgaaggccc taagggagac 
cagggtattg caggttccga cggtcttcct ggggataaag gagaactggg tcccagcggc 
ctggtcggac ccaaaggaga gtctggcagt cgaggggagc tgggccccaa aggcacccag 
ggtcccaacg gcaccagcgg tgttcagggt gtccccgggc cccccggtcc tctgggcctg 
cagggcgtcc cgggtgttcc tggcatcacg gggaagccgg gagttccggg gaaggaggcc 1560 
agcgagcagc gcatcaggga gctgtgtggg gggatgatca gcgaacaaat tgcacagtta 1620 
gccgcgcacc taaggaagcc tttggcaccc gggtccattg gtcggcccgg tccagctggc 16 8 o 

ccccctgggc ccccaggacc cccaggctcc attggtcacc ctggcgctcg aggacccccc 1740 
ggataccgcg gtcccactgg ggagctggga gaccccgggc ccagaggaaa ccagggtgac 1800 
agaggagaca aaggcgcggc aggagcaggg ctggacgggc ctgaaggaga ccaggggccc I860 
caaggacccc aaggcgtgcc cggcaccagc aaggacggcc aggacggtgc tcccggcgag 1920 
cctgggcctc ccggagatcc tgggcttcca ggtgccattg gggcccaggg gacaccgggg l9BO 
atctgcgaca cctcagcctg ccaaggagcc gtgttaggag gggtcgggga gaaatcaggc 2040 
tctcgaagct cataaaattc aacgtgagga agcaagtgac aaggacgccc gaagcacagt 2100 
ggacggtcat gaaggagcgg gggtgtggca ggcgggtgac gtccaggaga gggagcgccc 2160 
ctggctgccc ctcggccgcc gactggacgc gtgggccttg ccagcgagca ccctcattgg 2220 
gctgtcgcct gacagcatac ctcaaaaggc cctagctaat aaacctgtaa gcccagcatt 2280 
tgagagaagg tagggtgtgt atatataaaa ggttgtgtac aactccacga ggtgaaaaat 2340 
attcagtaac ttgtttgcat agcatttgtg taaagactat gatctcatcc caataaaatg 2400 



600 
660 
720 
780 
840 
900 
960 
1020 



1140 
1200 
1260 
1320 
1380 
1440 
1500 



atatattaaa tcttcagatt aatgactggc tacagagtaa caaaaaataa acaatttaat 2460 
gtacagtaaa ttctctccca ttaac 2460 

// 2480 



Homo sapiens cPNA FLJ20113 fis, clone COL05437. 
f is (full insert sequence) ; oligo capping. 



/translation=»MAAKEPQQQKQEPLGSDSEGVNC^TOEAIMAQQDRIQQBIAVQN 
PiVSERLELSVLYKEYAEnDDNIYQQKIKDI^KKYSYIRKTRPDGNCFYRAFGFSHLEAL 
I^DSKELQRFKAVSAKSKEDLVSQGFTEFTIEDFHOTFI^LIEQVERQTSVADLLASFN 
DQSTSDYLWYLRLLTSGYIiQRESKFFEHFIEGGRTVKEFCQQEVEPMCKESDHIHIIA 

IaAQAIiS VS I QVEYMDRGEGGTTNPHI FPEGS EPKVYLLYRPGHYD ILYK" 



aattggcaac ccggaagcgg tcggtagtgc ggcgctgttt aaagatggcg gcggaggaac 
ctcafcaoca gaagcaggag ccgctgggca gcgactccga aggtgttaac tgtctggcct 
atoaSaagc catcatggc? cagcaggacc gaattcagca agagattgct gtgcagaacc 
JtctggtSc agagcglctg gagctctcgg tcctatacaa ggagtatgct gaagatgaca 
acatcKLa acagaagatc aaggacctcc acaaaaagta ctcgtacatc cgcaagacca 
ggcctgacgg caactg?ttc tatcgggctt tcggattctc ccacttggag 9<=actgctgg 
a?gacagcaa ggagttgcag cggttcaagg ctgtgtctgc caagagcaag gaagacctgg 
tglcccaggg c?tcac?gaa tlcacaattg aggatttcca caacacgttc atggacctga 
tSagcagS ggagaggcag acctctgtcg ccgacctgct ggcctccttc aatgaccaga 
gcacctccga c?acc??gtg gtctacctgc ggctgctcac ctcgggctac ctgcagcgcg 
agagcaagtt cttcgagcac ttcatcgagg gtggacggac tgtcaaggag ttctgccagc 
aggaggtgga gcccatgtgc aaggagagcg accacatcca catcattgcg ctggcccagg 
ccctcagcgt gtccatccag gtggagtaca tggaccgcgg cgagggcggc accaccaatc 
cgcaca?c?t ccctgagggc tccgagccca aggtctacct tctctaccgg "tggacact 
acgatatcct ctacaaatag ggctggctcc agcccgctgc tgccctgctg ^ccccctctg 
cclggcgcta gacatgtaca gaggtttttc tgtggttgta aatggtccta tttcaccccc 
ttcttcctgt cacatgaccc ccccccatgt tttattaaag ggggtgctgg tggtgagccg 
tgtgtgcg?g tccctgctct gctgcccgcc tggctgctct gtctgctgcc ccctcccccc 
2g?ggg?cc ccctgctttt cacctatcta ctcctgagct tccccaacag gagcaggttt 
SSggccag gcctcttgga ggcccctcct gcttcgttgg gttctgcttc cttcccttct 
lllll^cll aggggct?ct Lgggatcct ggaagttcct tagggacttg ^ccagggtcc 
cagggccacc cacacttcat ctgctccctc ataggcccca cctccacgtc ccggctgggc 
ccclgacccc agcttcctgc cctccaccgg gagtctgcat ggttgggagt cctgggtgga 
ggggcctttg tgaggctgga cccggctcag ggcaggtgga ggagctgggc ctcccacagg 
Sgcccgggc agtglcatcc tggtggggga gggcagcctt caaacgtgtg 999tctacag 
?cctcagqtc taggcagggc agccggttct ccacctcccc atccgcccca ggccccctgc 
ctgtgccSc cKgcaScc cLtgcttgg gccacggtgt ctctgcattg cctgcctttt 
tgc2tcacc tcttttcttc cccgccccct gcacattcgg ggtctcagcc cccaggctgt 
glgctccttg ggggcaggcc ctcaataaat gtgaactgct gctgccgcca aaaaaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1747 



aaaaaaa 



m seance" 1 NIH - M ° C - 20 Homo sa P*ens cDNA clone IMAGE =4026010 5', mRNA 



aattgatatt ttttgctgct tcctcggccc aggagaaagc atgtcaggac agagctgttg 

gattggcttt gatagaggaa tggggatgat gtaagtttac agtattcctg ggqtttaatt 

gttgtgcagt ttcatagatg ggtcaggagg tggacaagtg gggccagaga tgatggcagt 

ccagcagcaa ctccctgtgc tcccttctct ttgggcagag attctatttt tgacSttgc 

acaagacagg tagggaaagg ggacttgtgg tagtggacca tacctgggga ccaaaagaga 

cccactgtaa ttgatgcatt gtggcccctg atcttccctg tctcaclclt cttttcSS 

atcccggttg caatctcact cagacatcac agtaccaccc caggggtggc agtagacaac 

aacccagaaa tttagacagg gatctcttac ctttggaaaa taggggt?fg gcatSaggt 

ggttgtgatt aagaagatgg tttgttatta aatagcatta aactggaat? J^ 9 ** 99 * 



60 
120 
180 
240 
300 
360 
420 
480 
532 



Human plasma serine protease (protein C) inhibitor mRNA, complete cds. 

/ translation^ " MQLFUjLCZjVLIiS PQGASLHRHHPREMKKRVEDLHVGATVAPS SR 
RDFTFDLYRAIiASA^SQNIFFSPVSISMSLAMLSLGAGSSTKMQILEGLGIiNLQKSSE 
KELHRGFQQljLQEIjNQPRDGFQLSLGNAIjFTDIiVVDLQDTFVSAMKTL 
DSAGAMKQINDYVAKQTKGKI VDLLKNLDSNAVVIMVNYI FFKAKWETS FNHKGTQEQD 
F YVTSETVVRVPMMSREDQYHYLIjDRNIjSCRVVGVPYQGNATALF I LPSEGKMQQVENG 
L SEKTIiRKWIiKMFKKRQIjEIjYIjPKF S I EGS YQIiEKVLPSIjG I SNVFTSHADLSRI SNHS 
NIQVSEMVHKAVVETOESGTRAAAATGT^^ 
IiGKVNRP" 

aattccggca gagctccgtt tcctcataga acaaagaaca tccaccatgc agctcttcct 
cctcttgtgc ctggtgcttc tcagccctca gggggcctcc cttcaccgcc accacccccg 
ggagatgaag aagagagtcg aggacctcca tgtaggtgcc acggtggccc ccagcagcag 
aagggacttt acctttgacc tctacagggc cttggcttcc gctgccccca gccagaacat 
cttcttctcc cctgtgagca tctccatgag cctggccatg ctctccctgg gggctgggtc 300 
cagcacaaag atgcagatcc tggagggcct gggcctcaac ctccagaaaa gctcagagaa 360 
ggagctgcac agaggctttc agcagctcct tcaggaactc aaccagccca gagatggctt 420 
ccagctgagc ctcggcaatg cccttttcac cgacctggtg gtagacctgc aggacacctt 480 
cgtaagtgcc atgaagacgc tgtacctggc agacactttc cccaccaact ttagggactc 540 
tgcaggggcc atgaagcaga tcaatgatta tgtggcaaag caaacgaagg gcaagattgt 600 
ggacttgctt aagaacctcg atagcaatgc ggtcgtgatc atggtgaatt acatcttctt 660 
taaagctaag tgggagacaa gcttcaacca caaaggcacc caagagcaag acttctacgt 720 
gacctcggag actgtggtgc gggtacccat gatgagccgc gaggatcagt atcactacct 780 
cctggaccgg aacctctcct gcagggtggt gggggtcccc taccaaggca atgccacggc 840 
tttgttcatt ctccccagtg agggaaagat gcagcaggtg gagaatggac tgagtgagaa 900 
aacgctgagg aagtggctta agatgttcaa aaagaggcag ctcgagcttt accttcccaa 960 
attctccatt gagggctcct atcagctgga gaaagtcctc cccagtctgg ggatcagtaa ^ « 

cgtcttcacc tcccatgctg atctgtcccg catcagcaac cactcaaata tccaggtgtc 
tgagatggtg cacaaagctg tggtggaggt ggacgagtcg ggaaccagag cagcggcagc 
cacggggaca atcttcactt tcaggtcggc ccgcctgaac tctcagaggc tagtgttcaa 
caggcccttt ctgatgttca ttgtggataa caacatcctc ttccttggca aagtgaaccg 
cccctgaggt ggggcttctc ctgaaatcta caggcctcag ggtgggagat gaagggggct 
atgctatggc ccatctgtat gctggtagct agtgatttac gacaggttta gttgactaga 
tgaggcatta caaataatat tactctatgg atgattgctt ccacccacac gactgcaaca 
tacaggtgcc ttggggaaat gtggagaaca ttcaatctgc cgtcactatt catcaatgaa 
gattagcact gagatccaga gaggctggat ggacttgctc aagttcacca gcatggtagt 1560 
ggcaaagaga ggtccagagt cctggccctt gatgccgagc tcatgccaca aagctcagta 1620 
ggagggatgt tccagtggat gagggccagc caggaagcac aggtccaagg ctggtcccac 1680 
acttatcagc agcaacaact gtcagttcat cctgcatggg aaaaatgttg gaatgggagt 1740 
ctgaaatggg gctactgttt cagtcctaac gtgctgtgtg acattgggac aacactttcc 1800 
ctctctggac ctcagtttcc ctctgtatac aaggatcaga ttcttgctgt gacccaagaa I860 
ctcctgaaat catatagaaa ggctggggtg ggccctgtca ttcgtggttg atttcaatac 1920 
actcaagtgc cattcatcct ttaagaaaaa catctggata tcaaggtgga aatggcccat 1980 
ttaatgattg attatatcat tttgtggata tagttataat ctgatgggcc tggctgggag 2040 
tggaagaagg gaagcctttt gcaaatagta gagtgtcagt tgcaggtgcc aatgactaac 2100 
LJ _. 2106 
tttttg 
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E Homo sapiens DKFZP586A0522 protein, mRNA (cDNA clone MGC:5320 
E IMAGE:2900478) , complete cds . 

T / translation "MELTIFIlJUAlYIl.TPPLYLljNFLGLWSWICKKWFPyFLVRFTV 

IYNEQMASKKRELFSNLQEFAGPSGKLSLLEVGCGTGANFKFYPPGCRVTCIDPNPNFE 
T KFLIKSlAENRHLQFERFVVAAGENMHQVADGSVDVVVCTIiVLCSVKNQERIIjREVCRV 
T IiRPGGAFYFMEHVAAECS TWNYFWQQVLDPAWHLLFDGCNL.TRE S WKALERAS FS KLKL 

T QHIQAPLSWELVRPHIYGYAVK" 

tgagcaatgg agcttaccat ctttatcctg agactggcca tttacatcct gacatttccc 60 

ttgtacctgc tgaactttct gggcttgtgg agctggatat gcaaaaaatg gttcccctac 120 

ttcttggtga ggttcactgt gatatacaac gaacagatgg caagcaagaa gcgggagctc 180 

ttcagtaacc tgcaggagtt tgcgggcccc tccgggaaac tctccctgct ggaagtgggc 240 

tgtggcacgg gggccaactt caagttctac ccacctgggt gcagggtgac ctgtattgac 300 

cccaacccca actttgagaa gtttttgatc aagagcattg cagagaaccg acacctgcag 360 

tttgagcgct ttgtggtagc tgccggggag aacatgcacc aggtggctga tggctctgtg 420 

gatgtggtgg tctgcaccct ggtgctgtgc tctgtgaaga accaggagcg gattctccgc 480 

gaggtgtgca gagtgctgag accgggaggg gctttctatt tcatggagca tgtggcagct 540 

gagtgttcga cttggaatta cttctggcaa caagtcctgg atcctgcctg gcaccttctg 600 

tttgatgggt gcaacctgac cagagagagc tggaaggccc tggagcgggc cagcttctct 660 

aagctgaagc tgcagcacat ccaggcccca ctgtcctggg agttggtgcg ccctcatatc 72 0 

tatggatatg ctgtgaaata gtgtgagctg gcagttaaga gctgaatggc tcaaagaatt 780 

taaagcttca gttttacatt taaaatgcta agtgggagaa gagaaacctt ttttttgggg 840 

ggcggttttt ttggtttgtt gttggttttt tttttttttt ggcgggaaga aagagttttg 900 

ctcttgttgc ccaggctgga gtgcagtgac gtggtctccg ctcactgcaa cctccacctc 960 

gcgggtttag gcgattcttc tgcctcagcc tccctagtag ctgggattgc aggtgcccac 1020 

caccatgccc agctaatttg tatttttagt tgagacaggg tttcactacg ttggccaggc 1080 

tggtcttgaa ctcctgatct caggcaatcc acccacctca gcctcccaaa gtgctgggat 1140 

gacaggcgtg agcaaccgca cccagcttaa ggtttttttg ttttgttttg agacggagtt 12 00 

ttcgctcttg ttgcccaggc tggagtgcaa tgctgtgatc tcggcttacc acaacctcca 1260 

cctcccgggt tcaagtgatt cacctgcctc agcctcctga gtagctggta ttacaggcat 132 0 

gcgtcaccac gccggctaat tttgtacttt tagtagagat ggtgtttccc cacgttggtc 1380 

agtctggtct caaattcctg acctcaggtg atctgcctgc ctcggcctcc caaagtgctg 1440 

tgattgcaga cgtcagccac catgcctggc ctgaaacctt ttttaggtaa agttgaattc 1500 

catccttaaa agtttctgtt atatcctatt tagccatttt ctattgtctc ccaaagaatt 1560 

cacatcaaaa aaacagcttt gaactccccc ttcaaaggaa acagtcgact ttcataatta 1620 

gcatctacca ttatccccaa atcttatttt attcattgac ttgaaatttt ttccaattgc 1680 

tttttttttt ttttttttta aggttaagag cagaggttta ctaggccaaa gaaagagaat 1740 

agctctctgt tgcagagagg ggtcctggag aaatgggtta ccccagttgt cttatttaaa 1800 

tggttaccca tcagatttta attttatctt ctctttgaga gcttggtaat aagaagcact i860 

taaatcactc caaagaagac tttaaaaagg gagcagtgaa aaggtcttaa taatttattg 192 0 

attgaattaa gaaatactag ctaattaaga atctgagtct aaacagcaca gattttttct 1980 

ttctgctttt aaattgtgtt ttaaaaaaag agacaggggg ctgggcgtgg tggctcgcgc 2040 

ctgtgatcct agcactttgg gaggccgagg cgggtggatc acgaggtagg agttaaagac 2100 

^ cagcctggcc aacatggcaa aaccctacta aagatacaaa aaaaaaaaaa aa ~ 2152 



Homo sapiens calcium binding protein 1 (calbrain) , mRNA (cDNA clone 

/ trans lat ion= " MCQC^RVCVCVCACATQRASHSALPGTTI SVKDWRLCLLDQFDAC 
ARSGLSEPRSLTIJlVPSCGKPIiPGPGARIiGREVTPCLSFAFAWCWLKMCQEEQTSYMVV 
QTS EEGLAADAELPGPLLMIiAQNCAVMHNIjIiGPAC I FLRKGFAENRQPDRSIiRPEEIEE 
LREAFREFDKDKDGYINCRDIJGNCMRTMGYMPTEMELIELSQQIN^^^GGHVDFDDFVE 
LMGPKMJ^TADMIGVKELRDAFREFDTNGDGEISTSELREAMRKLLGHQVGHRDIEEI 

IRDVDLNGDGRVDFEEFVRMMSR " 

qgtgggtgcc tgtagaccaa gctgctcagg aggctgaggc aggagaatca cttgaatccg 
ggagtcagag gttgcagtga gccaagatca cgccactgca ctccagcctg ggcgacagag 
tgcactagcc acacacaaaa aaggaggggg catgtttcca ctttgcccca gtccccacct 
ctcacaccct gcctgcttcg tttatgtcag caatggcacc tgcatgtgca acctctggtc 
tggcagacca gccaagcctg gtctccatgc tgccagctcc gattgccctc actgccttcc 
agcggggatc tggctggggt aggacaagtc tgtttgggag ctgggaggaa tgacttagcc 
ctggagattg actgaattgg gaggttggga gcattttggg aaaggatgtt gtctcattag 
gctctgagtg aacacttccc attggtgaga atagaagccc cccgctgccc ctgtctcttc 
atgcacttgg gtgtgtgatt ctgtgcatat gtatgtgcca gtgcgtgcgt gtgtgtgtgt 
* t t cgc at gtgccaca cagagggcat cacattcagc tctccccgga accaccattt 
ccgtcaagga ttggagactc tgtctgttgg accaatttga tgcctgtgca cgctcagggc 
tctctgaacc ccgctccttg actctcaggg tcccctcttg cggaaagccc ctcccgggac 
caggagccag gctgggcagg gaggtgacac cctgcctctc ttttgcattt gcttggtgct 
ggctgaagat gtgccaggag gaacagacca gctacatggt ggtgcagacg agcgaggagg 
ggctggcggc tgacgccgag ctcccgggac cgctcctgat gctggcccag aactgcgcag 
tcatgcacaa cctgctgggc cctgcctgca ttttcctgcg caagggcttc gctgagaaca 
ggcagcctga tagatcactg cgaccagagg aaattgaaga gctccgagag gccttcagag 
aattcgacaa ggacaaggat ggctacatca actgccggga tctgggcaac tgcatgcgca 
ccatgggcta catgcccacc gagatggagc tcatcgaact gtcccagcag atcaacatga 
acctgggtgg ccatgtagat tttgatgact tcgtggagct aatggggcct aaactcctgg 
cagagacagc agatatgatt ggtgtaaagg aactgcgaga tgctttccga gagtttgaca 
ccaatggtga tggggaaata agcaccagtg agctgcgaga ggctatgagg aa 9Ctcctgg 
gtcatcaggt gggacaccga gacatagagg aaattatccg agatgtggac ctcaatgggg 
atggacgagt ggactttgaa gagtttgtcc ggatgatgtc ccgctgaggc cgcgagggcc 
cctccaggac tgccaagctc ccaaaggcgg ggctaagagg agctagagct tgcctcaccc 
gctgtagccg ccgagagccc aggatgtact ggcggatggg gcctgcctgc accccgggga 
ggcgcccacc ccggaccccc acccctccgc actgtgaaag actaactcct gcaactggaa 
agcgggggcg cccgccgacg aggaggccac cgtgccaagc cggcagaggt catgccaggc 
gccaagggcc atgtgcccag ctgctgctgg ctgggtgggc cagggagccc gccagcagac 
cccacacagc atgtccgccc cagggcaaag cctcccacct tcgctctgcg cccgtcccag 
ctcgcctcag ccctgttatc tcagaaccaa taaaaatatt tccaagagca aaaaaaaaaa 
aaaaaaaa 
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Homo sapiens TNNT1 gene, exons l-ll (and joined CDS) 



source 



promoter 
TATARS ignal 
5 ' UTR 

prim_transcript 
mRNA 



mRNA 



mRNA 



exon 



exon 



CDS 



.10866, 



1. .16689 
/ chromosome= n 19 n 
/db_xref = « t axon : 9 6 0 6 " 
/mol_types "genomic DNA" 
/organisms "Homo sapiens" 
/clone= "DMPC-HFF#1 - 14 95 -E10 " 
/map="ql3.4" 
1. .2580 

/notes "promoter region" 
/genes "TNNT1" 
2552. .2557 

/note= "putative TATA box" 
/gene= "TNNT1 " 

join (2581. .2626,4587. .4597) 
/gene="TNNTl" 

join (2581. .16689, AJ011713. 1:1. .1846) 
/gene="TNNTl" 

join (2581. .2626,4587. .4629,4734. .4747,5048. .5074, 
6189. .6210,9829. .9892,10447. .10563,10789. .10866, 
13679. .13792,14577. . 14686 ,AJ011713 . 1 : 519 . . 609 , 
AJ011713. 1:748. . 788 ,AJ011713 . 1 : 1715 . . 1846) 
/note= "alternatively spliced mRNA" 
/gene= "TNNT1 " 

/products "troponin T, slow skeletal muscle" 
join (2581. .2626,4587. .4629,4734. .4747,5048. .5074, 
5289. .5321,6189. . 6210 , 9829 . . 9892 , 10447 . . 10563 , 10789 , 
13679. .13792,14577. . 14686 , AJ011713 . 1 : 519 . .609, 
AJ-011713. 1:748. . 788 ,AJ011713 . 1 : 1715 . . 1846) 
/notes "alternatively spliced mRNA" 
/genes "TNNT1" 

/products "troponin T, skeletal and cardiac muscle" 
join (2581. .2626,4587. .4629,4734. .4747,5048. .5074, 

5289. .5321,6189. .6210,9829. .9892,10447. .10563,10789. 
13679. .13792,14577. . 14686 ,AJ011713 . 1 : 471 . .609, 
AJ011713. 1:748. . 788 ,AJ011713 . 1 : 1715 . .1846) 
/note="alternatively spliced mRNA" 
/genes "TNNT1" 

/products "troponin T, skeletal muscle" 

2581. .2626 

/numbers i 

/gene= "TNNT1" 

4587. .4629 

/number=2 

/gene= "TNNT1" 

join(4598. .4629,4734. .4747,5048. .5074,6189. .6210, 

9829 . . 9892 , 10447 . . 10563 , 10789 . . 10866, 13679 . . 13792 , 

14577. . 14686, AJ011713. 1:519. . 609 , AJ011713 . 1 : 748 . . 788 , 

AJ011713 . 1 : 1715 . . 1760) 

/ db_xr e f = n SPTREMBL : 095472 " 

/note= "alternatively spliced isoform" 

/genes "TNNT1 " 

/products "slow skeletal muscle troponin T" 
/protein_id="CAA09750 . 1" 

/ translations "MSDTEEQEYEEEQPEEEAAEEEEEEEERPKPSRPVVPPLIPPKIP 
EGERVT)FDDIHRKRMEKDIiIjELQTLIDVHF 



.10866, 



CDS 



CDS 



exon 



exon 



exon 



exon 



exon 



exon 



exon 



exon 



FRTEKERERQAKIiAEEKMRKEEEEAJKKRAro 
RQTGREMKVRILSERKKPIoDIDYMGEEQIJtfSK 
E INVLYNRI SHAQKFRK6AGKGRVGGRWK" 

join (4598. .4629,4734. .4747,5048. .5074,5289.-5321, 

6189. .6210,9829. .9892,10447. .10563,10789. .10866, 

13679. .13792,14577. .14686, AJ011713. 1:519. .609, 

AJ011713. 1:748. .788, AJ011713 .1:1715. .1760) 

/db_xref= n GOA:P13805 n 

/ db_xref= " SWISS - PROT : P13 8 0 5 n 

/note= n alternatively spliced isoform" 

/gene="TNNTl » 

/products "slow skeletal muscle troponin T" 

/protein id=»CAA09751. 1" ^^^^ 
/ trans 1 at ion = 11 MSDTEEQEYEEEQPEEEAAEEEEEAPEEPEPVAEPEEERPKPSRP 
WPPIiI PPKI PEGERVDFDD IHRKRMEKDIjLEIjQTLi IDVHFEQRKKEEEELVALKERIE 
™ ^mnvrm ™ ™ vr tv t?t? VTTO mcwTra 7AKTTR AEDDAKKKKVIiSNMGAHFGG 



RRRSERAEQQRFRTEKERERQAKLAEEKMKJUSlsiai^^ — 
yXvTCAEQKRGKRQTGREMK\^IIjSBRKKPIiDIDYMGEEQIiRE 
DLMAKLKQQKYE INVLYNRI SHAQKFRKGAGKGRVGGRWK " 
join(4598. .4629,4734. .4747,5048. .5074,5289. .5321, 
6189. .6210,9829. .9892,10447. . 10563 , 10789 . . 10866 , 
13679. .13792,14577. .14686, AJ011713. 1:471. .609, 
AJ011713 . 1 : 748 . . 788 , AJ011713 . 1 : 1715 . . 1760) 
/db_xref="GOA:P13 805" 
/db_xref="SWISS-PROT:P13805 n 
/note=" alternatively spliced isoform" 
/gene= n TNNTl" 

/product= n slow skeletal muscle troponin T" 

/protein id= n CAA09752 . 1" 

/trans lat ion= "MSDTEEQEYEEEQPEEEAAEEEEEAPEEPEPVAEPEEERPKPSRP 
VVPPIjIPPKIPEGERvT)FDDIHRKRrffiKDLIjELQTIjIDvliFEQRKKEEEELVALKERIE 
r.nmr.tmTi T?n oTv xtt . ATSWTTMT? TfT^^KE AKKRAEDDAKKKKVLSNMGAHFGG 



YL VKAE QKRGKRQTGREMKVR I L SERKKPIiD I D YMGEEQIjRARS AWIj PP S Q P S C PARE K 
AQELSDW IHQIiESEKFDIiMAKLKQQKYE INVLYNRI SHAQKFRKGAGKGRVGGRWK M 



4734. .4747 
/number=3 
/gene="TNNTl n 
5048. .5074 
/number =4 
/gene="TNNTl n 
5289. .5321 

/note=" alternative exon" 

/number =5 

/gene= n TNNTl" 

6189. .6210 

/number=6 

/gene="TNNTl" 

9829. .9892 

/number=7 

/gene= n TNNTl n 

10447 . .10563 

/number =8 

/gene="TNNTl n 

10789. .10866 

/number^ 9 

/gene= n TNNT1 n 

13679. .13792 

/number=10 



FT /gene= !, TNNTl n 

FT exon 14577 . . 14686 

FT /number=ll 
FT /genes= ,, TNNTl" 

FT intron 14687 . . >16689 

FT /note=»approx. 200 bp gap with AJ011713" 

tT /number=ll 

FT /gene="TNNTl" 
KX 

3Q Sequence 16689 BP; 3888 A; 4422 C; 4743 G; 3615 T; 21 other- 

gtgactccat gtctattacc cagggcttag gcaggagaca gatgggaaga ctgc4ggtgg 
^ CaCaCa gCaggttggg gaccagatgg gtctcccatg tgaagcac?c 
ttggctgtgt tattgaaaag aatcccgggg ttcatgaatt tggagagcgg agctttgttt 
cttaagaagc ggatcacaac ctgaagacca gaagcatggc ttcttgccla aLacaXag 
caggcacttt aagggaggga agggcaaggc aggaatttat gctgagtggg ttagctaagt 
gcacgtattc aactggttat agaaggagct atgaatattc atgjacagg? ggacacatgg 
acacacgcat gtgtgacaag caaacactca tttttttttt tttttgagL Igagtctt^c 

lal?tT°l C ■ 9 K t ! 9 T gCagtggcac Sfatctcagct cactgcaacc tctgcgtcct 
gggttcaagc cattctcctg cctcagcctc tccagtagct gggattacag gcatgcacca 
ccacgcccag ctaatttttg tgtttttcgt agagacgggg tttcaccatg ttggccaggc 
tggtctcgat ttcctgacct cgtgatccgc cctccttggc ctcccaaagt gctjggatta 
caggcgtgag ccaccgcgac tggccgcaaa cacattttac atgcatccca ccttcacttg 
SSSSS CttaaCa " t -Stscattc cattgaatgc attcgtatcg aaacacgaa^ 
aa 9 acac tca gtgcacagcc tctgtcaaca gccagaacca gtccgtggtc 
XXSS l** c *9S*9* aagttactga aatcagtctc ttgttcaact aatgctg?ag 
ttatgccttg tggaacaggg ggttcagcta gtcagcatct ggcaatggat ggctqcaata 
gttgtaatat tgcttatctt gaggccggtg cttgtttagc ?|ctagaaaa Sgaaaagcc 
ttgtggcagt tggaacatag tcgattatat aagtgtag^g a^gcgfggct taaccct?gc 
SSSE C ! ta ? 9tCtt gtttataatt tggtatctta ttgccacaaa cagttcat?c 
Ctga J CtCta ttgtcacagc tcctctggcc catgtggagt gcLagagat 
gccaagacta aaggtcaaag agagagagac acagagaaac agacaggcag agagacatga 
^!^ aCa9 agagactgag ttgggagtag agagagtagg agccaggcal aiacccaaag 
222? aaaataCCac aa 9 a tgcata gaaggaagtg caaagaaagg aiaaaagagJ 
tactcaca 9 c aaggtgcaga gacatggaga ccctataaga gatggaaatg 
III JS™* gagggggaac tgcgggcact aagagagaca tggagagaac taglLgata 
Sf fl ! cacagacaca gagacccaga gccaaaaaga gacagaaatg gggtcaLgg 
tggacagaaa tgcacagaga gactgtgagc ccccacagac acaaacgctg agaggggcag 
a «? Ca9a9 aCaCCCacag a aatgcagag acaaaaccac ccggaglcac a?gccSga! 
a ««! a 939 ggg * tcagaa agcgacagat gcagagagag cccaagacag acccagcaag 
f™" 9a9 aaatgcacag aaacacaaag accgagacca ccagagacag aggtacggai 
SSf" 9 ggtgaggaag ^ctgctctcc agggctggct tgaacctaga ac£caai|t? 
atggcttggc tgggaagcgg gagggggata ccaaaaaaga tcagcgttag atggaggtgg 
ggagggaggg taggtttcac cctaattgac accccaccct ccccctcccc ctcfccCIc 
alalllltll gCtttagtgc ctgttgtcac caggatggaa tcctggcggc agacaaaggg 
gtgggggtgg gaggaggaga caaaaagatg gacagagaca gaggaggaga ctaagaagta 
gagacagaca cacagaagta gagacagaca cacagagaca cacagatgga gagccacKa All 
gacaacagaa atgggggtcc agaaagagac ctagltagaa agagggaaca SSgaaSaa 222S 

-ttccttata gcccatgtgt cattagtlgg tccJccaagg IcctactSg 
tgccgggcat ggggacgtca aagcacaggg agctataagg gaggtagttg gagcaccaco 
ggagcctggg gtgtggggtg ggagcagggg cggggctcct gggccatgca gatggggttg 
gcatggggtt ggggaggggc ccacggaggc ttctggactc tgagtgtgga gccaggggca 
gcagggctgt ctctgaactc ccaggcttgt gtttgagcaa aigLSSc LcccSccc 
tctcccctcg gggcgggccc gagcctccag ctataaactc cccggagctt cagtgccctc 
agcaaggctc agcctcaaga ttcacagcat ctcagacaca gcctaggtaa ggggctctac 
ggagccctgt ggtcttcaaa aaccatgtcc ttgggggagg gggcaggagg caggtgggga 
agatgtcggg tgtggagaaa tggtgcagag gtaggtgggg caLcaacg gag|tc?ggg 
aacttgagag gctcagactt tagggggact caggatagag gggaaaatac taggatttgt 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



gggttacaca aatcctaacc cggggctgcc aggacttgga tgtcccagaa taaaacctcc 
taqaatcttg gggttctggg aatctggggg aaatctcagc agtcagggat atctggactt 
ggSgtcaaa Stetg£t gtcaggattt ggggcctcag actctgaggg tatcaggaat 
tgggggggtc tcagaagaaa ttggggtttt ggtagtggag ggctcggaat ctgagacatt 
a?ga?SLa gcatgaaaac ttagagtgcc attatctggg gtatcatgaa ctgggcctcc 
gagtttgggt atcaagattt tgggggtctc atattcttga ggtttcagat tttaggaggt 
ctcaagacct gggggtggtc ttaggatgtc ccggcccata tctggactat caagaactgg 
gatttcagaa tccagagctg tgaggacttg ctggcctcag aaacacgggc atcataaatc 
ggggtctcag aatctgtggg tattgggatt tggggctagg caccctgtgg cctagggtgt 
ctcaaagact ttgaggtcta agaatgtagg ggtcagggtg aggtctacgg acagaaggtg 
ccctttatgc ccctgccccc tacgtgatgt atgggaaggt gaaaggggag acaggtgggg 
aggaaaccag ccagtggggt ggggcgagtg ggagagggga tgcgggtgac taattctccc 
agaagccccc cttcctcaaa tcccttctta ggaaatgtcc tcggtttcac catctatgac 
atccccccaa aatagctcct gggggtgggg cagctattgt cttcaggcca ctgtcccttc 
tcaaatgcct cttcctaatc ccaactggat caggttccca tggacttgtc ataagacaaa 
agaggacagc tgtgctgagg gggcagggtc tgcagcctcc tggctgtgcc aggaccacac 
ctaccaaggt ctgtcctcat gcatgcttta ggacagccgg ccccctccct cagacccaag 
agtccagacc tgagccctcc tccctcagac gcaggagtac aggcccccag ccccctcctc 
cctcagactc aggagtccag actcccagcc tctcctccct tagatccagg agtccaggcc 
ccagctcctc ctccctcaga cccaggaggc caggccccca gaccctcctc cctcagaccc 
aggcgtccag gcccgcggcc cctcctccct cagacccagg cgtccaggcc cctggcccct 
cctccctcag atccaggagt ccaagcctcc agcccctccc ccccnagacc caggggtcca 
ggcccgcggc ccctcctccd ddaagaccca ggggtccagg acaacccatc tcagtannim 
ggagtccaca nccaccctcc cacacacccc ccannnanat cctccctaag acccaggaat 
ccgaccccca gcccctcctc cctcagatcc aggagttcag gaccccagtc cctcctccct 
cagacccaga agtccaggcc ccagcccctc cttcctcaga cccaggagtc caggccccag 
ctcctcctcc ctcagaccca ggagtccagg ccccagctcc tcctctctca cacccaggag 
tccagacccc agctcctcct ctctcacacc caggagtcca gacccaggag tccaggcccc 
cagcccctcc tccctcacac ccaggagtct aggtcgcccc ctccttgggg ccccacccca 
ctgggcctct tctctctggt ccttaggccg caccaggatg tcggacaccg aggagcagga 
atatgaggag tgagtgctgc actgggaggg gttggggcgt gcagtggtgg agaggctctc 
ccaccccagg tctaactctc ccctccctcc tctctcccca cacccgtccg cagggagcag 
ccggaaggtg agtagggagt gggccggccc tatgggatgg ggacacgccc aggagctctt 
ggqaggggag gcgaggagtg tggcactttg gcggggaagg ggcggctcag ccctcgggcc 
ctcgcccgcc ctctccgggt ctggagcgtc tcctcgcgcc atccctgcac cgccaggggg 
aacgtgccca tcgcagccga ggccagaggg acccccgtcc agcagagact ggaaccctgg 
tctctggcca ccgcccccat ccctccccca gctcctaacg ccttctcttc tgtgccccca 
tctccagagg aggctgcgga ggaggaggag gaaggtgagg tcccggactc cgcaggtctg 
gagctggggg gtgggggggc ggggacgctg ggcccgggag gagggggcgg gatggggctc 
ccgacgqcct cggctcccgg tcctctaaca gcctctcacg tctcttccct ctctcccctc 
ccgcgcccgg gcgtggccgc tcccccctcg ctccccaaac cctcgctgct cctccctccc 
ctccccagcc cccgaagagc cggagccggt ggcagagcca ggtaaccgcg gctgcttcgc 
ttcccgagac cgatgtcggg ggcccggggg gacccccgat atcctcgggc cccaacctcc 
aagaccaccc ttccctccct ccgcccccat aacccgcccc caggcccggt cctttaagcc 
cctgagagct cctgcgccct ctcggggccg ggaagggaaa tggccccgca gtggctttaa 
ccctttccct tccttcttaa agggaccgac acccaagcct tcttccccag tctggaggaa 
tctttacaga agggttggat gtttgcggag aagcaccacc ccatcccccc ttatcttcgc 
ctccaacctt gtctgttatt ttctccccca tatacattca aaggttgggg cttggaattt 
aggattgcag aggaggcgag cgctgggagt cagggtctcc aaggtcgggc cgggggcggg 
ggggcggtgg ggcggtgggc agggggcgag ctggggagag agcgctgggg tcactgaatg 
aagacagagg ttgggacccc ctgcactgct gggtcctaat ggaggagagc aggagctctg 
ggcatataaa tagagaaagg tctgggggtt tgcactcctg ggtgtgcaga cagtgggggt 
qqqqggtccc acaccagttc tgaatgtata tctgatgcgt gtgaccctct tcctccttta 
cagccaccct gactttcctt ccctcttttc ttctccccat cccccctcct ccagaactgg 
actccagagt ccttccctgg tgtcctgggg gccgaggctt ggggggtccc tccccagggt 
^ atcaggcc tagcccctct cctctaaact cttgctaatt atctcttttt atcccccacc 
accaacagaa gaggaacgcc ccaaaccaag gtgagatcct acagggtggg gggccccagt 



2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 



catacatctt cctaaatatt ccctctaccc ccagagacaa ggtgtgagaa aggcagagat 6300 

ggagggagga aattgaaaca gcagtcgaca tttctcctcc tcctcctcct ccctccccct 6360 

ttccctcctc cctcctctcc ctcctcctcc ctcctcctcc tccacttcct cttcttcccc 642 0 

ttccccttct ccttctcctt ccccttcttc ttcatcgtgg tcatagtctc caccctgtta 6480 

cccaggctag agtgcagtga cccaatcctg gcccactgcc tcaacatctt gggctcaagc 6540 

tttttttttt ? r 9ta9 r ^accacag gcatgtgcca ccacacctlg 6600 

tttttttttt tttttctttt tgtagagatg aggtcttgct atgttgctta ggctggtctt 6660 

gaactcctat cctcaagcag ttctcccacc ttggcctccc aaagtgctgg gattacaggc 6720 

gcaagccacc atgcccagcc agtctgtttc cttctttcca cccagtggtc cctcttcccc 6780 

tcgcgccttt accatcaccc cagcctcctc totggcctcc ctgcctacaa cttgtggctc 6840 

atcctcctcc atgactcgag agagactttt ctaacaccag agctaaccct gcccctcctc 6900 

tgctcagaat ctttcctcaa ctccccacta ctcttaagac gcagtcaaag caactagcct 6960 

gcattcaaag ccttccccat ctggctccag cggacctttc cggcatctgc cttccccact 7020 

?J?oS«n aCa " 9 = 9tttCa tggttcteta gocatgcctg tgctttcata cccctgagcc 7080 

tttgcacaca ctgtgccctc tgcctgcgca gcggcaggtg ggaggatcac ttgaggatag 7140 

gagttcaaga tcagcctaag taacaaaatg agatccccca catctctaca aaaaatttta 7200 

aaaaattagc cgggcacagt ggcacacacc tgtgattcca gctactcagg aggctgagga 7260 

aggaggattg ttcgaaccca ggatttggag gctgcagtga gctatgatta calcactgfa 7320 

ctccagcctg ggcaacagag caagaccctg tctctaaaac ataaataaaa ataagggaat 73 80 

ggggccgggc gcagtggttc atgcctgtaa ttccagcact ttgggaggct gaggggggcg 7 440 

gattacctaa ggtcaggagt ttgagaccag cccggtcaac atggglLc teSSSS 7500 

ctaaaaatac aaaagaatta gccaggtatt gtggtgggca cctgtaatcc tagctatttg 7560 

S^!S* 9 9 ° a " a9aat cgcttgaacc gggaggcaga ggttgcagtg agtcaagatc 7620 

EES?** acggcagcct gggcaacaag agtgaaactc catctcaaaa aaaaaaaaag 7680 

J££SS*!t 9999t " 999 aatggactga gagggattcc aaggcaaagg gacaggactt 7740 

ggggattgtt gggcaagggg agggaagcaa tggggacaac cccgaagggt ctctgggctg 78 00 

tllin^t ^ 99aCa9t " Saccate tccttctcag agaagttggt gagctccatc 7860 

aggacgtcat gtgtgggaag gtgggaaaag ttcaggagat acctgttagc ttgagtctgg 792 0 

ggcttaggag agagtctgga ccagagacag ttggcgagtc ctcagcagtt gatgatacct 7980 

gaagtcattg gagtgaatgg cattgatgat gagaaaaggg aaaaggaaag ctcactggca 8040 

9 t ??" f fc 9 tc 3gggc aggagatagg gatggaacag ccagaggggt gggaggcaag 8100 

ccggtattca ttcctgtaca atatggtaag ccatatttta ttcccaciL SLiLcS sl 60 

atggagctta tcgacttttc caagatggag atgacggccc tgctgtacac ttggccccct 8220 

ggcccagaga cccttgggga ttgtcctcac ctcttccctc cccttgccca gcatccccta 8280 

a ?^ 9tCt 9= ttcatct t ggaatcgctc ttcactgtct cagaaaccca gaaaagagaa 8340 

ttccaaggaa gtggcaggaa atcagagcag gctgagggct gagcctcaga gcggcaccac 8400 

ccaatagaaa tagatggcaa gccacacacg tcacttacaa tttcttacta aaggctgggc 8460 

aeSiS-^ aC9CCtata9 t<=<~agcact ctgggaggct gaggcgggag gatcac^ga 8520 

gctcagaagt tcaagaccac cctggacaac atagtgaaac gccatctctt tttttttttt 8580 

;^ a9a ^ 9a 9tt * ccctct gtcacccagg ctggtgtgcg atggcacgat ctcggctcac 8640 

tgcaatctct gcctccgggg ttcaagcgat tctcctgcct cagcctcctg agtagcaggg 8700 

attacaggca cgtaccacca tgcccagcta attttttttt ttaatttcta gtagagatgg 8760 

ggtttcacca tgatgcccaa gctggttttg aactcctgac ctcaagtgat tcgcccacct 8820 

aagtgctagg attacaggcg tgagccaccg cgcccaggct gtgaaacgcc 8880 

tltlllt aaaatttaaa aatggctggg cgcaggggct cctgcctgta atcccagcac 8940 

tttgggaggc tgaggcaggc agatcgcctg aggtcgggag ttcaagacca gcttggccgg 9000 

cacggtgaaa ccccgtctct actaaaaata caaaaattac ccaggcgtgg tggcaggcgc 9060 

ctgtaatccc aactactcgg gaagctgagg caggagaatc gct?gaatc? gigaggfgga llto 

99 S" 9t9 9 fc 9 a 9 ca 9 a g atggcgccac tgcattccag cctgggtgac agagtgaggc 9180 

tctgtctcaa aaaaaaaaaa aaattcttac taggcatatt ttttaaagta caacaatata 9240 

ggaaaaatta attttcatag cacagttttt ttaatctgtt atataatgtt ttatcatttc 93 00 

aacatgtaat cagtataaaa catgattagt gccgggcgcg gtggctcaca cctgtgatcc 9360 

cagcactttg ggaggccgag gcgggcggat cacgaggtca ggagatcgag accatcctgg 942 0 

ctaacacagt gaagccccgt ctctactaaa aatacagaaa aattagccag gcttggtggt 9480 

gggcgcctgt agtcccagct actcgggagg ctgaggcagg agaatggcgt gaacccagga 9540 

ggcggagctt gcagtgagcc gaggtggcgc caccgcactc cagcctgggt gacagagcaa 9600 

gactccgtca aaaaaacaaa acaaaaccaa caacaaccaa aaaacatgag tagtgaggta 9660 



ttttaccttc 
ggagcctccc 
ctcagggagg 
gcctcctttg 
caagagaccc 
aaaggcgccg 
gtgacccacc 
ctactaaaaa 
gggaggctga 
tcgcgccatt 
cccactcccg 
attgctgcca 
ggaggtgctg 
ctgcaggaca 
gatgtacatt 
attgtgagcc 
tcttgggaga 
gaattcaccc 
agactggggg 
tcagagagag 
ctggcggtgg 
cattttgtta 
ggagtctagc 
tctgccttcc 
gtgcgcacca 
ttggccagga 
gctgggatta 
tggaaatagg 
tataactagg 
actatatags 
gacagcattt 
tgctttcacc 
ctccattttc 
tcgctctgtc 
tcctggttca 
ccaccaggct 
aggctggtct 
gattacaggc 
ttttttttat 
tcaagtgatc 
cccagctgtg 
tgtcctggat 
ccagagggtc 
ggtccatggg 
tagaacctcc 
ttcagatgtc 
aatttgcaaa 
atcccagcac 
gcctggccaa 
tggcacatgt 
aggaggtggg 
acagagcgag 
tgtacaaaaa 
ttctggaaca 
aaccccttcc 



caccagttac 
gactgtggcc 



ttttgtgtgt 
ggggctggcg 
tcgggccact 
atcccgccaa 
ctccgcaggc 
gccgggcgcg 
tgaggtcagg 
tacaaaaaat 
ggcaggaaaa 
tcactccagc 
tctttgggaa 
cctgctggat 
ggaggctgct 
tccaccgcaa 
tcgagcagcg 
gagagtccgg 
tgcagataat 
aagtcggtgg 
tacgtccctg 
ccgagcaaca 
gtgcctcccc 
ttattattat 
tctgtcgccc 
aggttcaagc 
ccacgcccgg 
tggtcttgat 
caggtgtgag 
gtcttgctct 
acctcctggg 
stgcgccact 
tgccatgttg 
tcccaaagtg 
ttntxittccc 
acccaggctg 
agcaattccc 
cggctaattt 
tgaactcctg 
atgagccacc 
ttgagatagg 
ctcctgcctc 
aaggggattt 
tatctgtggg 
agactgagag 
cccagacatg 
agaataaatt 
tgatctgcag 
ttatagcagt 
tttgggagac 
catggtgaaa 
ctgtaatccc 
ggttgcagtg 
actccatctc 
gcctcatgtc 
ttcaatgaag 
tcaggctcaa 
ttcccttgac 
cagacaggct 



ttcactgaca 
gctacctcac 
catggtctct 
agatcccaga 
gcagttgcta 
gggcctcatg 
agttcgagac 
agccgggcgt 
tcgcttgaac 
ctgggaggaa 
gcctgtcctt 
accgtcctgg 
gccccctcca 
gcgcatggag 
gaagaaggag 
gttccccccg 
agttttcctc 
ccacactcca 
caccccctta 
gcgcttcaga 
tgccctgaga 
tattattatt 
aggctggagt 
gaatcttctg 
ctaaattttg 
cttctgacca 
ccaccgcatc 
gtcacccagg 
ctcaaatgat 
ctgccccact 
tccaggctgg 
ctgggatgac 
ttttttttta 
gagtgcagtg 
ctgcctcagc 
ttgtattttt 
acctcaggtg 
atgcctggcc 
gtctttctat 
agcctcccga 
tgcagatgtg 
gaagggcaag 
atttgaagat 
caggcagcag 
ggttctgttc 
agctgtaaga 
gataggaaac 
cgagacaagt 
ccctgtctgt 
agttacttgg 
aacagagcag 
aaaaacagaa 
caccataagg 
agtttctcga 
ggc cat cage 
cccctcaaaa 
gagcatctgg 



gcccgcctca 
tggaeggcac 
gtgtttctgt 
aggggagege 
gtctttaagg 
cctgtaatcc 
cagcctggtt 
ggtggcaggt 
ctgggaggcg 
aaaaaaaaaa 
ggaagagctg 
gaaaeggtec 
gggtctccta 
aaagacctgc 
gaagaggagc 
gtcttcctcc 
ctagtacaga 
atctgtttat 
tgcttctccg 
actgagaagg 
geccaaatgt 
attattacta 
gcagtggcac 
cctcagcctc 
tatttttagt 
catgatccgc 
eggectatta 
ctgaagtgca 
tttcccacct 
agtttttttt 
tcttcaactc 
aggcatgagc 
attttaattt 
acgcgatctc 
ctcctgagta 
agtagaaatg 
attcaccccc 
attgtcatca 
gttgcccacg 
gtagctggga 
attaaattaa 
atcatcacaa 
getgeatgge 
ctggaaaagg 
acaacttgat 
taacaaatct 
taagtttagg 
ggatcacctg 
actaaaaata 
gaggecgagg 
agattgeace 
aggaaactaa 
agaegggget 
atgtcgtaat 
ctctttaatc 
cagacattct 
agtgaggtcg 



gctgggaccc tccacgttca 
ateceggagg tgggaggtgg 
ccactcagcc gccccgtggt 
gttgacttcg atgtaagtga 
ggcctttgtg tcaatcatga 
cagcactttg ggaggctgag 
aatatggtga aaccccgtct 
gtctgtaatc ccagctactc 
gaggtggcgg tgagecgaga 
aaaggcgccg gtccccactc 
attagtgtca aacacgaggc 
agttcaccat cctgcatggg 
ggacgggctg cccgtgtgtc 
tggagctgea gacactcatc 
tggttgcctt gaaggagege 
ctccatgtgg atcccttgca 
getgagcett aggctttege 
tagcctactc tggggaagga 
tttcccagga gcggcgccgg 
aacgegaacg tcaggctaag 
tacttcttca gccggatgcc 
ttattmttat tctttgaaac 
gatctcagct cactgtaacc 
cccagtagct gggactacag 
agagatgggg tgtcaccatg 
ccacctcagc cttccaaagt 
ttatttttta ttcgtttatt 
gtggtgtgat cctagctcac 
cagcctccag agtcgctgga 
atttttttat tttttgtaga 
ctaagctcaa gcaatccacc 
catcgtgccc ggcctggatt 
tttttttttc tgagacagtc 
agctcactgc aacctccgcc 
gctgggatta caggcacctg 
gggtttctcc atgttggttc 
ttggcctccc aaagtgctag 
ttattactat tatnatnatt 
ttgttcccaa actcctgggc 
ttacagccac ctgcccgtca 
agatcttgag atgggaaggt 
gggtccttat aagaggaagg 
tgcctctgaa gatgaaagaa 
gaagggaatg aattctcccc 
tccagcccag ggggaccaat 
gcattgtttt tctgccacta 
cgcgatggct gacgcctgta 
aggecaggag tttgagacca 
caaaaattag ctggtaatgg 
caggagaatc acttgaaccc 
actgcactcc agectgagtg 
ttcaggtacg gagtgctggg 
cagcctggac aaaccactgt 
gccttcgctc aatattccag 
tccccagtcc ctggtcttat 
catatcctga gactaagggc 
tacagcagag ttcactatcg 



9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 



gctatttcag 
ttttttctgg 
taatcccagc 
cagcctggcc 
ggtggcgggt 
ccgggaggcg 
gagcgagagt 

cac ggggaga 

cccagcagcc 
gccctcctcc 
ggagaagatg 
aaagaaggtg 
gcctgcagga 
ttcctaacgg 
cctgaactcc 
gaggggctgg 
gggtctgaag 
ggcttggact 
ggggctgggg 
ggtctgaagg 
tctggactcc 
gagggtctgg 
atctgaggga 
cagacctgtg 
gcagtgctct 
aacagaagcg 
gtaagaagcc 
gggctcgggg 
ctgggtgggc 
tcccctctct 
ggctgccgtg 
cctcccccaa 
ctgtgccttt 
tcgcatctat 
ggggcttgcc 
gaaacttcaa 
ctgcccaggg 
cttctctggg 
acagagctgt 
atgcctgtaa 
tggagaccag 
gcagcttcct 
aatagtgagg 
caagtggaag 
agtgtgctgt 
gtctagattt 
actacccgtg 
gaagccctct 

tgttggccgg 
cgggcagatc 
ctaaaaatac 
aggctgaggc 
accactgcac 
aaaaggctga 

tggtgtcact 

gaaagaaaag 
gcaattctac 



ggtggagaca 
gaggatatta 
actccgggag 
aacatggtga 
gcctgtaatc 
gaggttgcag 
ccgtctcaaa 
cgtggcaggg 
tctgcggcac 
ctcagtccta 
aggaaggaag 
ctgtccaaca 
gcctgaggtc 
aggaggggct 
tgggtctgag 
ggtcctggac 
aaggaggggc 
cagtctgaga 
gcctgcactc 
aggaggggct 
tgggctgagg 
ggcttggact 
ggagggagtg 
gtcccaacat 
gtgggatgtg 
tggtaagcgg 
tctggacatt 
tgtaggggcc 
agatactccg 
gggcctctgc 
gcctgtagca 
gtttaggaga 
tacctaaatt 
agccaagaca 
ttcaggagcc 
ccaccccaga 
ctcagtgcca 
gctcccattc 
gaaagggcag 
ttccagcact 

°ctgggcaat 

gtggagggtt 
aacctgaggc 
agctgggatt 
aaagtggctt 
agcacatgcc 

tggggacact 
ggggggcgct 

gcgggcgcgg 
acctgaggtc 
aaaaattagc 
aggagaatca 
tccagcctgg 
ggtgagagga 
gcactccagc 
aaggaaggaa 
aaaatcagag 



tgcagaaaaa 
agacccagag 
gccgaggcag 
aaccgcgtct 
ccagctactc 
tgagccaaga 
aaaaaaaaaa 



ccgggagggg 
tgggtctgct 
caggggagtc 
aggaagaggc 

tgggggccca 

tgagttctga 
gaggcctgga 
ggaggaggag 
tcctgggctg 

tgggggccag 

ggaggggctg 
ctgggtctga 

gggggtctgg 

gaggaggggc 
cagtctaaga 
gggctggact 
ggaaatggtc 
gcccccagtt 
cagacggggc 
gactacatgg 
tgggtgtgag 
ggtctcagct 
tgccttcttt 
tctgacaggt 
ttttgttaac 
caagccagga 
gcctgcctct 
cgcctgtttc 
gtggcgccgt 
gctttgccgg 
ctcattggca 
tgagatagta 
ttgggaggct 
gtagtgagac 
tacaaccctg 
acagagaggt 
tgaacccggc 
catggtaaat 
cattcccagg 
gagtgtggag 
gtgggccctc 
cggctcacac 
aggagaccag 
cgggcgtggt 
ctgaacccag 
gcgacatgag 
tcactagagc 
ctgggtgaca 
ggaaggaagg 
accagactcc 



gcaggcattt 
aagggggcca 
gcggatcact 
ccactaaaaa 
aggaggctga 
tcgcaccact 
aacagagggg 
gtacccctgc 
gatccctgtg 
catgaggtta 
caagaagcgg 
ttttggcggc 

gggggcaggg 

ttcctgggtc 
ctggggtctg 
agggaggagg 
acttctggtt 
ggggcctgga 
gggaggaggg 
actcctgggc 

tgggggccag 
gaagggactg 
cctaggtctg 
tgtgggatcc 
tactgttccg 
gggagatgaa 
gggaggaaca 
acagggggat 
ccagtgctat 
gtgaaacatg 
gttaccaagg 
tccgtgatgt 
atgcgggtga 
catgataagc 
tccatgttct 
gatcgggttt 
ctactttgtg 
gaacaaggat 
ataataatta 
aagatgggag 
cccatcctta 
tgaggtagat 
taagtaaccc 
agtctggtgc 
gtttaacagc 
gcgtaaatac 
acgggaaagg 
cttcctgtgc 
ctgtaatccc 
cctgaccaat 
ggcaggcgcc 
gaggcggagg 
cgaaactctg 
ctgagaggtt 
aagtgagacc 
aaaggtaggt 
tatgttttct 



cttcttctcc 
ggtgcggtgg 
tgaggtcagt 
tacaaaaatt 
ggctggagaa 
gcactccagc 
aagggacttg 
gtttctgctg 
ccccagttcc 
accccttact 
gcagaggatg 
tacctggtca 
gctgggggcc 
tgagggagga 
gacgcctggg 
ggctgagggc 
cctaatgcga 
ctcctgggtc 
gctcggggcc 
tgagggagga 
acttctggtt 

ggggtctgga 

aaggagatgg 
gacgtgtggg 
cctcccccac 
ggtgcgcatc 
gctccggtag 
aaaggagctc 
gtgactttgg 
tcactgaacc 
aagccagaca 
gccttggagc 
gaattttcca 
tacaggacgg 
ctggcctggg 
gccgcttgca 
atcttgcgca 
ataatcacaa 
caggctgggc 
gattgcttga 
aaaaaataat 
actattatta 
ttccagggtt 
aatagtccgt 
tgctctccag 
ttccaccttg 
atgtgcacag 
acatggttca 
agcactttgg 
atgatgaaat 
tgtaatccca 
ttgcggtgag 
tcttaaaaaa 
gagactgcgg 
ctatcaaaaa 
tttagtccag 
gcttcactca 



ttcatttcga 
ctcacgcctg 
agtttgagac 
agttgggcgc 
ttgcttaaac 
ctgagcaaca 
taggtggtca 
gttctgatgt 
acatcccagg 
tcccacagga 
atgccaagaa 
aggtgagtca 
agacctctgg 

ggggctgggc 

tctganggag 
ctggacgcct 
gagggtctgg 
tgagggagga 
tgcaatcctg 

ggggctgggg 

cctaatgcga 
ctcctcctgg 
gcctggggtg 
tcccgacacg 
cctcaggcag 
ctctccgagc 
gtgctgtggg 
acacccagat 
gcaagtggct 
cacttttcgt 
agatgacttc 
tcaggatggg 
gctgagtccc 
caggctggtg 
ctgtggctca 
ccgaaagccc 
cgttatctga 
aattgacctc 
ccggtggctc 
ggccaggagt 
aattacagca 
tccttgtgtt 
gcacagctag 
actctttgct 

ggggcaaaat 

ctcctttcaa 
caggctcctg 
tgtaaagtat 
gaggctgagg 
cccatctcta 
gctactctgg 
ccgagatggc 
aaaaaaaaaa 
tgagctgtga 
aaaaagaaaa 
ggctggtatg 
ctacttttag 



13140 

13200 

13260 

13320 

13380 

13440 

13500 

13560 

13620 

13680 

13740 

13800 

13860 

13920 

13980 

14 040 

14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 



Homo sapiens negative growth-regulatory protein MyDllS (MTO118) mRNA, 
complete cds . 

/ 1 ran slation=" MTLEELVACDNAAQKMQTVT AAVEELIiVAAQRQDRIiTVGVY^ S AK 
LMNVDPDSVVLCIJ^IDEEEEDDIAIiQIHFTLIQSFCCDiroiNIVRVSGNARIAQLLGE 
PAETQGTTEARDLHCl.PFLQNPHTDAWKSHGLVEVASYCEESRGNNQWVPyiSLQER" 



ccqcatccac tgtggattat aattgcaaca tgacgctgga agagctcgtg gcgtgcgaca 
acgcggcgca gaagatgcag acggtgaccg ccgcggtgga ggagcttttg gtggccgctc 
agcgccagga tcgcctcaca gtgggggtgt acgagtcggc caagttgatg aatgtggacc 
cagacagcgt ggtcctctgc ctcttggcca ttgacgagga ggaggaggat gacatcgccc 
tgcaaatcca cttcacgctc atccagtcct tctgctgtga caacgacatc aacatcgtgc 
gggtgtcggg caatgcgcgc ctggcgcagc tcctgggaga gccggccgag acccagggca 
ccaccgaggc ccgagacctc cactgtcttc ccttcctaca gaaccctcac acggacgcct 
ggaagagcca cggcttggtg gaggtggcca gctactgcga agaaagccgg ggcaacaacc 
agtgggtccc ctacatctct cttcaggaac gctgaggccc ttcccagcag cagaatctgt 
tgagttgctg ccaacaaaca aaaaatacaa taaatatttg aaccccctcc cccccagcac 
aaccccccca aaacaaccca acccacgagg accatcgggg gcaggtcgtt ggagactgaa 
gagaaagaga gagaggagaa gggagtgagg ggccgctgcc gccttcccca t ca <;99 a 9f9 
tccagactgt ccactcgggg gtggagtgag actgactgca agccccaccc tccttgagac 
tggagctgag cgtctgcata cgagagactt ggttgaaact tggttggtcc ttgtctgcac 
cctcgacaag accacacttt gggacttggg agctggggct gaagttgctc tgtacccatg 
aactcccagt ttgcgaatta ataagagaca atctattttg ttacttgcac ttgttattcg 
aaccactgag agcgagatgg gaagcataga tatctatatt tttatttcta ctatgagggc 
cttgtaataa atttctaaag cctcaaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1050 



)E 
DE 



yzl2fl2.sl Soares_multiple_sclerosis 2NbHMSP Homo sapiens cDNA clone 
IMAGE : 2 82 863 3«, mRNA sequence. ~ 
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tggagaagga 
ctcactcagc 
gtttcctgaa 
tcacatttgc 
tcggccctcc 
tgtcccagaa 
ttctaagccc 
ctggtttctt 
ctggctggtc 
gaatgatctg 



aggacagttt 
aaacaaaaca 
attaacagtg 
aaagaatact 
tgataccctc 
ggtccatttg 
ctgaatttgc 
tccctcaatg 
cacgctggcc 
tgctgggana 



ttcttcctcc 
ggatgtagac 
attagttaca 
atggctaacc 
agctcttcac 
gttcccaaag 
aagtaaagaa 
aagatgncca 
ttcagcatgg 
cctccctaan 



aagagtacca 
ctggtttgct 
ccaagcaaga 
ctcatcccct 
aaacgtggcg 
cacactcaag 
tcactgacta 
ggtctgggtg 
ccaataagct 
ggatgaagg 



atttgaccac 
aaggagtttt 
gaagatataa 
actgcgcatg 
ttcatacagc 
gttttgtgtt 
acagaatttt 
tgaggagcac 
ctttcctggc 



tcccactaac 
aatgagttct 
tgtctcgctt 
caacacagcg 
ttgctcagct 
tgctttcatt 
ggcacaatga 
ctggcctcaa 
tcgcntttga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
579 



Homo sapiens synaptogyrin 3, mKNA (cDNA clone M6C:20003 IMAGE: 4334996) , 
complete cds . 

/ 1 rails lat ion= " MEGASFGAGRAGAAIiDPVSFARRPQTLIiRVAS WVFS IAVFGPI VN 
EGYVNTD S G PELRCVPNGNAGACRFGVALGLGAFIiAGZUVFLLLDVRFQQ I S S VRDRRRA 
VLIiDIiGFSGLWSFLWFVGFCFLTNQWQRTAPGPATTQAGDAARAAIAFSFFSILSWVAIi 
TVKALQRFRLGTDMSLFATEQLSTGASQAYPGYPVGSGVEGTETYQSPPFTETIjDTSPK 

GYQVPAY" 



cagcggcctc gggcggggcc ggccggacgg acaggcggac agaaggcgcc aggggcgcgc 
gtcccgcccg ggccggccat ggagggcgcc tccttcggcg cgggccgcgc aggggccgcc 
ctggaccccg tgagctttgc gcggcggccc cagaccctgc tccgggtcgc gtcctgggtg 
ttctccatcg ccgtcttcgg gcccatcgtc aacgagggct acgtgaacac cgacagcggc 
cccgagctgc gctgcgtgtt caacgggaac gcgggcgcct gccgcttcgg cgtcgcgctg 
ggcctcggag ccttcctcgc ctgcgccgcc ttcctgctgc tcgatgtgcg cttccagcaa 
atcagcagcg tccgcgaccg ccggcgcgcg gtgttgctgg acctgggctt ctcaggactc 
tggtccttcc tgtggttcgt gggcttctgc ttcctcacca atcagtggca gcgcacggcg 
ccagggccgg ccacgacgca ggcgggggac gcggcgcggg ccgccatcgc cttcagcttc 
ttctccatcc tcagctgggt ggcgctcacc gtgaaggccc tgcagcggtt ccgcctgggc 
accgacatgt cactcttcgc caccgaacag ctgagcaccg gggcgagcca ggcctacccc 
ggctatccgg tgggcagcgg cgtggagggc accgagacct accagagccc gcccttcacc 
gagaccctgg acaccagccc caaagggtac caggtgcccg cctactagcg gctggcaggc 
acagaccagg gctccaaggc caccccacca acgcaggccc cagggtctcc gggacctccc 
ttgggtcctt ccagctcagt gccgcggaca gagtaggtgg ccgctttgcg ccatccgggg 
ccaagagggg gtggacccgc gtgtctgggc tgcccctgcc aagttccccc agtccctcag 
cacctggccc caggactgag gtcctgagaa ggggatagca ctgcccagga cgtgtgtccc 
tagcctggaa tggactggcc tggggaaggc tttcccctct tgggccacac ctgctcactc 
tggggttggg ggtccagctg ccctctacga tcaggtgcag gggctgccca ggacaaagcg 
gggacagggg aaagacacca ccctcgcccc aagactgggg atcctggcca ctgttcccat 
cccatgtccc tgtgggtagt gactgtctcg tttctgtcat ggtggtgcgt cccgtccgga 
gccactctcc actttctctc acaggctgct agaacagccc agccctgtca gtgttgtgat 
catggtccag tcttcgggtt tcacctccta gtactccaca agctgctcct ctctctgtgg 
ccccggcccc tgcccaggtg tgggtggttc tggccaggaa ggcacaaggt agctgtgggc 
caagacacca gccctgtcct agcccttcag taagaccttg ccaggagagg agaaggatgc 
ctgggtgcca ggcaagacaa gcccctcagc aggagagagg cccagaggct ccagctggcc 
accgtgcccc acaagatggc ccctgtgtgg ttccctttac cttggcttcc tggcccagtc 
cctgcctctc cacctgcacc ctgcttcctg gcccagtccc aggttggagt ccctctgcat 
agctgactac tcatgcattg ctcaaagctg gcttttcaca ttaagtcaac accaaacgtg 
gttgccacat ttcatcagac agacacctcc ctctggagat gcagttgagt gacaaccttg 
ttacattgta gcctagacca attctgtgtg gatatttaag tgaacatgtt tacaattttt 
gtatatatca ctctctccct ctcctgaaag accagagatt gtgtattttc agtgtcccat 
gttccgactg caccttcttt acaataaaga ctgtaactga gctgactgtg aaaaaaaaaa 
aaaaaaaaaa aaaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
1996 



5E Human 14 kd lectin mRNA, complete cds 
CX 

<W lectin. 

^ / translation= " MACGLVASNLNLKPGECLRVRGEVAPDAKSFVLNLGKDSNNLCLH 

FNPRFNAHGDANTIVCNSKDGGAWGTEQREAVFPFQPGSVAEVCITFDQANLTVKLPDG 
YEFKFPNRLNLEAINYMAADGDFKI KCVAFD " 

cttctgacag ctggtgcgcc tgcccgggaa catcctcctg gactcaatca tggcttgtgg 60 

tctggtcgcc agcaacctga atctcaaacc tggagagtgc cttcgagtgc gaggcgaggt 120 

ggctcctgac gctaagagct tcgtgctgaa cctgggcaaa gacagcaaca acctgtgcct 18 0 

gcacttcaac cctcgcttca acgcccacgg cgacgccaac accatcgtgt gcaacagcaa 240 

ggacggcggg gcctggggga ccgagcagcg ggaggctgtc tttcccttcc agcctggaag 3 00 

tgttgcagag gtgtgcatca ccttcgacca ggccaacctg accgtcaagc tgccagatgg 360 

atacgaattc aagttcccca accgcctcaa cctggaggcc atcaactaca tggcagctga 420 

cggtgacttc aagatcaaat gtgtggcctt tgactgaaat cagccagccc atggccccca 480 
ataaaggcag ctgcctctgc tcccctg 



7 



507 



Homo sapiens monocarboxylate transporter 2 (MCT2) mRNA, complete cds. 

/translations "MPPMPSAPPVHPPPDGGWGWIVVGATFIS IGFSYAFPKAVTVFFK 
EIQQIFHTTYSBIAWISSIMIiAVMY^^ 

F S S S WQLYLTMGF I TGLGLAFNLQP ALT 1 1 GKYFYRKRPMANGLAMAG1TP VFL S S LAP 
FNQYLFOTFGWKGSFLILGSLLLmCV^^ 

IKTKKSTWEKVNKYLDFSLFKHRGFLIYLSGNVIMFLGFFAPIIFPAP 
AAFLLSVMAFVDMFARPSVGLIANSKYIRPRIQYFFSFAIMFNGVCm^ 
VLYAVFFGLGFGS VS S VLFETLMDLVGAPRFS SAVGLVT IVECGPVLLGPPLAGKLVDL 
TGE YKYMYMSCGAI VVAASVWLL IGNAINYRLLAKERKE KS KHS E 

DVNVKVSNAQSVT SERE TNI " 

cgggcgccca ccctgcgcca gagaccagat aaagatcaat cttaagatgt gatactttcc 
tgtgaaacct gaaacaaggt gatctgggga accaaagact ctgggactct tggtgccaac 
agagttactc tgttacttga atttccacta gaggagcaga aatgccacca atgccaagtg 
ccccacctgt gcatccacct ccagatggag gatggggttg gattgtggtt ggagcaactt 
ttatctccat tggattttcc tatgcattcc ccaaagctgt caccgtattc ttcaaagaaa 
ttcagcaaat attccacact acctacagtg aaatagcatg gatttcatcc attatgctgg 
ctgttatgta cgcaggaggt cctgtaagta gtgttttggt gaataaatac ggcagccggc 
cggtggtgat agcaggaggc ttattatgct gtcttggaat ggtgttggcc tcctttagta 
gcagcgtggt acagctgtac ctcactatgg gattcattac aggtttaggt ttagccttca 
acctgcaacc cgccttaacc ataattggca aatacttcta taggaagcga cccatggcaa 
atggattggc catggcagga aatcctgttt tcttaagttc attggctcct ttcaatcagt 
acctttttaa tacttttggc tggaaaggaa gcttcctgat tttgggaagt ctacttttga 
atgcctgtgt ggctggttcc ctcatgagac cccttggacc caatcaaacc acttctaagt 
ctaaaaataa gactggcaaa acagaagatg attcaagccc aaagaaaatc aaaacgaaga 
aatcaacttg ggaaaaagtt aataagtatt tagatttctc cctttttaag catagaggat 
ttctgatata tctgtctgga aatgtcatta tgttcctagg tttttttgcc cccattatat 
tcccggctcc atatgctaaa gaccaaggaa ttgatgagta ctcggcagct tttctgctat 
ctgttatggc tttcgttgat atgtttgcta ggccttctgt aggattaatt gcaaactcca 
aatatattcg acctcgaatt cagtacttct tcagttttgc aatcatgttc aatggagtgt 
gtcacctctt gtgcccactg gcacaggact acacaagcct ggtattatat gctgtatttt 
ttggccttgg atttgggagt gttagcagtg ttctctttga aactctcatg gacctcgtgg 1260 
■ gtgcaccaag attttccagt gccgtcggac ttgtcacaat tgtggagtgt ggcccagttc 1320 
ttcttggccc tcctcttgca ggtaaattgg tggatttaac tggagaatat aaatacatgt 1380 
acatgtcctg tggggctatt gtggtagcag caagcgtgtg gctgctcatt ggcaatgcta 1440 
tcaactatag attgcttgca aaggaaagga- aggaggaaaa tgcaaggcag aagaccagag 1500 
aatctgaacc cttgagcaaa tctaaacatt cggaagatgt taacgtcaaa gtttcaaatg 1560 
cacagagtgt aacctcagaa agagaaacta acatttaaca agaatcacat ctctgatttc 1620 
agtgtttatg actttatcta ggagtttgtt tttcattttg tttttttaaa gtattagaaa 1680 
aggttttagc tgaaatgagg agtcacaatt aaggatggag gtgatatttt cctcaatggc 174 0 

aattttaaat tagtttttaa aaacttactt atttgggtag ttaaattttg agattatgca 1800 
tagaaagaat ccatgctata ggtttatttc catacctgac tctgggtgtg gtggttaaaa I860 
tactaatttt aaagtcttcc agtgactttc ggtcttggtt atatgga 1907 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 



>E H. sapiens mRNA for gonadotropin -re leasing hormone receptor, splice variant. 
-X 

W gonadotropin-releasing hormone receptor. 

'T / translation "MANSASPEQNQNHCSAINNS I PLMQGJHjPTIjTLSGKIR VTVTFFIi 

T FLLSATFNAS FLLKLQKWTQKKEKGKKLSRMKIiLLKHLTIJVNLIjETL IVMPLDGMWNIT 

VQWYAGELLCKVXS YLKLFSMYAPAFMMVVT S LDRSLAI TRPLALKSNS KVGQSMVGLA 
W I L S S VFAGPQIiPLHHPS FHHADLQCKNHIiHPDTGPS S GP PRTTTES VQEQYTKS TAED 
SKNDGCICHFIYCIiLDSLLCPRNLVIjV 11 



'T 
T 



60 



atggcaaaca gtgcctctcc tgaacagaat caaaatcact gttcagccat caacaacagc 

atcccactga tgcagggcaa cctccccact ctgaccttgt ctggaaagat ccgagtgacg 12 0 

gttactttct tcctttttct gctctctgcg acctttaatg cttctttctt gttgaaactt 180 

cagaagtgga cacagaagaa agagaaaggg aaaaagctct caagaatgaa gctgctctta 240 

aaacatctga ccttagccaa cctgttggag actctgattg tcatgccact ggatgggatg 300 

tggaacatta cagtccaatg gtatgctgga gagttactct gcaaagttct cagttatcta 360 

aagcttttct ccatgtatgc cccagccttc atgatggtgg tgatcagcct ggaccgctcc 420 

ctggctatca cgaggcccct agctttgaaa agcaacagca aagtcggaca gtccatggtt 480 

ggcctggcct ggatcctcag tagtgtcttt gcaggaccac agctgcctct tcatcatccc 54 0 

tcttttcatc atgctgatct gcaatgcaaa aatcatcttc accctgacac gggtccttca 600 

tcaggacccc cacgaactac aactgaatca gtccaagaac aatataccaa gagcacggct 660 

gaagactcta aaaatgacgg ttgcatttgc cacttcattt actgtctgct ggactcccta 720 

ctatgtccta ggaatttggt attggtttga tcctgaaatg ttaaacaggt tgtcagaccc 780 

agtaaatcac ttcttctttc tctttgcctt tttaaaccca tgctttgatc cacttatcta 840 

tggatatttt tctctgtga " 859 



Homo sapiens midline 1 (MIDI) mRNA, complete cds. 

/ 1 reins 1 at ion= « METLESELTCPI CLELFEDPLLLPCAHSLCFNCAHRILVSHCATN 
ESVESITAFQCPTOlHVITLSQRGIiDGLKRWrLQNIIDRFQKASVSGPNSPSETRRER 
AFDAITIMTSAEKVIjCQFCDQDPAQDAVKTCVTCEVSYCDECIjKATHPNKKPFTGHRLIE 
PIPDSHIRGLMCLEHEDEKVNMYCVTDDQLICMjCKLVGRHRDHQVAAIjSERYDKIjKQN 

LE SNLTNIj I KRNTELETLLAKXi I QTC QHVEVNASRQEAKL1TEECDI1I1 I E 1 1 QQRRQ T.TG 
TKIKEGKVMRLRKLR.QQIANCKQCIERSASLISQAEHSLKENDHARFIjQTAKNITERVS 
MATASSQVLIPEI^NDTFDTFALDFSREKKIiLECLDYLTAPNPPTIREELCTASYDTI 
TVHWTSDDEFSWSYELQYTIFTGQAliryVSLCWSADSWMIVPNIKQNHYTVHGIiQSGTK 
YI FMVKAINQAGSRS SEPGKLKTNSQPFKLDPKSAHRKLKVSHDNLTVERDESSSKKSH 
TPERFTSQGSYGVAGNVFIDSGRHYWEWISGSTWYAIGLAYKSAPKHEWIGKNSASWA 
LCRCNiraWVVRHNSKEIPIEPAPHIJUlVGII^YDNGSIAFYDALNSIHI.YTFDVAFAQ 

PVCPTFTVWNKCLTI ITGLPIPDHIiDCTEQLP n 



ctttttttgg ccgggccgca tgaatcegge cagcccaccc tgcttgaagg acctacaggt 
ttgtctcttc cagatcagaa ctgaggaaca aaaaccccca tcctgggaaa aatggggaag 
ctgatttege egggttgett ttgtcttgcg ggctcctgtc gggttcggtg tttccgctct 
gaagactgcg acgcgggctc egatgeaget cgctccctgc cggatgggtc atgggattct 
aaacatgagg cagatagctg atcagcttcc ttgggttttg ctgatgacac aagagagctt 
tqectgaaga tggaaaoact ggagtcagaa ctgacctgcc ctatttgtct ggagctcttt 
qaggaccctc ttctactgcc ctgcgcacac agcctctgct toaactgege ccaccgcatc 
ctagtatcac actgtgccac caacgagtct gtggagtcca tcaccgcctt ccagtgcccc 
acctgccggc atgtcatcac cctcagccag cgaggtctag aegggctcaa gcgcaacgtc 
accctacaga acatcatcga caggttccag aaagcatcag tgagegggee caactctccc 
agcgagaccc gtcgggagcg ggectttgae gccaacacca tgacctccgc cgagaaggtc 
ctctgccagt tttgtgacca ggatcctgcc caggaegctg tgaagacctg tgtcacttgt 
qaagtatcct actgtgacga gtgcctgaaa gccactcacc cgaataagaa gccctttaca 
ggccatcgtc tgattgagee aattceggae tctcacatcc gggggctgat gtgcttggag 
cataaggatg agaaggtgaa tatgtactgt gtgaccgatg accagttaat ctgtgccttg 
tgtaaactgg ttgggcggca ccgcgatcat caggtggcag ctttgagtga gegctatgae 
aaattgaagc aaaacttaga gagtaacctc accaacctta ttaagaggaa cacagaactg 
aaqacccttt tggctaaact catccaaacc tgtcaacatg ttgaagtcaa tgcatcacgt 
caagaageca aattgacaga ggagtgtgat cttctcattg agatcattca gcaaagacga 
cagattattg gaaccaagat caaagaaggg aaggtgatga ggcttcgcaa actggctcag 
cagattgeaa actgeaaaca gtgeattgag eggtcagcat cactcatctc ecaageggaa 
cactctctga aggagaatga teatgegegt ttcctacaga ctgctaagaa tatcaccgag 
agagtctcca tggcaactgc atcctcccag gttctaattc ctgaaatcaa cotcaatgac 
acatttgaca cctttgcctt agatttttcc cgagagaaga aactgetaga atgtctggat 
taccttacag ctcccaaccc tcccacaatt agagaagagc tctgcacagc ttcatatgac 
accatcactg tgcattggac ctccgatgat gagttcagcg tggtctccta cgagctccag 
tacaccatat tcaccggaca agccaacgtc gttagtctgt gtaattegge tgatagctgg 
atgatagtac ccaacatcaa gcagaaccac tacacggtgc aeggtctgea gagcggcacc 
aagtacatct tcatggtcaa ggccatcaac caggegggea gccgcagcag tgagcctggg 
aagttgaaga caaacagcca accatttaaa ctggatccca aatctgetea tcgaaaactg 
aaggtgtccc atgataactt gacagtagaa cgtgatgagt catcatccaa gaagagtcac 
acacctgaac gcttcaccag ccaggggagc tatggagtag ctggaaatgt gtttattgat 
agtggccggc attattggga agtggtcata agtggaagca catggtatgc cattggtctt 
gcttacaaat cagccccgaa gcatgaatgg attgggaaga actctgcttc ctgggcgctc 
tgccgctgca acaataactg ggtggtgaga cacaatagca aggaaatccc cattgagect 
gccccccacc tccggcgcgt gggcatcctg ctggactatg ataaeggetc tatcgecttt 
tatgatgett tgaactccat ccacctctac accttcgacg tegcatttge gcagcctgtt 
tgccccacct tcaccgtgtg gaacaagtgt ctgacgatta tcactgggct ccctatccca 
gaccatttgg actgeacaga gcagctgccg tgagcgtctg gccacatgga getgetttet 
qqqgaacagt aaggttcagg ccactattta ggggactgag aaagcacagg cttcatgagt 
gtaatgaaat ctcaccagaa gtgtcccgaa ateggctcag atagggctca aaacaagaga 
ttcctctcct tttactgtgt cttgtattaa gtaegggett taataatttc tttaattttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



DE Homo sapiens midline 1 (MIDI) mRNA, complete cds. 

FT /translation- " METIiESELTCP ICLELFEDPLLL PCAHSLCFNCAHR ILVSHCATN 

l ' 1 ESVESITAFQCPTCRHVITLSQRGIiDGLKRNVTLQNIIDRPQKASVSGPNSPSETRRER 

AFDANTMTSAEKVIiCQFCDQDPAQDAVKTCVTCEVSYCDECLKATHPNKKPFTGHRl»IE 
PIPDSHIRGLMCLEHEDEKVNMYCVTDDQLICyUjCKLVGRHRDHQVAAIjSERYDKljKQN 
LESNLTNIi I KRNTEIiETLIiAKL IQTCQHVEVNASRQEAKLTEECDLL IE I IQQRRO I IG 
TKIKEGKVMRI1RKI1AQQ IANCKQC I ERS AS L I S QAEHSLKENDHARFLQTAKNITERVS 
MATASSQVLIPEINLNDTFDTFALDFSREKKLIiECLDYLTAPNPPTIREEIjCTASYDTI 
TVHWTSDDEFSVVSYELQYTIFTGQANVVSLCNSADSWMIVPNIKQNHYTVHGLQSGTK 
YIF1WKAINQAGSRSSEPGKLKTNSQPFIOJ5PKSAHRKLKVSHDNLTVERDESSSKKSH 
TPERFTSQGSYGVAGNVFIDSGRHYWEVVISGSTWYAIGIjAYKSAPKHEWIGKNSASWA 
LCRCNNNWWRHNSKE I PIEPAPHLRRVGILLD YDNGS I AF YDALNS IHLYTFDVAFAO 
PVCPTFTVWNKCLTIITGLPIPDHLDCTEQLP" 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



60 
120 



ctttttttgg ccgggccgca tgaatccggc cagcccaccc tgcttgaagg acctacaggt 
ttgtctcttc cagatcagaa ctgaggaaca aaaaccccca tcctgggaaa aatggggaaq 

ctgatttcgc cgggttgctt ttgtcttgcg ggctcctgtc gggttcggtg tttccgctc? i 80 

gaagactgcg acgcgggctc cgatgcagct cgctccctgc cggatgggtc atgggattct 240 

aaacatgagg cagatagctg atcagcttcc ttgggttttg ctgatgacac aagagagctt 300 

tgcctgaaga tggaaacact ggagtcagaa ctgacctgcc ctatttgtct ggagctcttt 360 

gaggaccctc ttctactgcc ctgcgcacac agcctctgct tcaactgcgc ccaccgcatc 420 

ctagtatcac actgtgccac caacgagtct gtggagtcca tcaccgcctt ccagtgcccc 480 

acctgccggc atgtcatcac cctcagccag cgaggtctag acgggctcaa gcgcaacgtc 540 

accctacaga acatcatcga caggttccag aaagcatcag tgagcgggcc caactctccc 600 

agcgagaccc gtcgggagcg ggcctttgac gccaacacca tgacctccgc cgagaaggtc 660 

ctctgccagt tttgtgacca ggatcctgcc caggacgctg tgaagacctg tgtcacttgt 720 

gaagtatcct actgtgacga gtgcctgaaa gccactcacc cgaataagaa gccctttaca 780 

ggccatcgtc tgattgagcc aattccggac tctcacatcc gggggctgat gtgcttggag 840 

catgaggatg agaaggtgaa tatgtactgt gtgaccgatg accagttaat ctgtgccttg 900 

tgtaaactgg ttgggcggca ccgcgatcat caggtggcag ctttgagtga gcgctatgac 960 

aaattgaagc aaaacttaga gagtaacctc accaacctta ttaagaggaa cacagaactg 1020 

gagacccttt tggctaaact catccaaacc tgtcaacatg ttgaagtcaa tgcatcacg? 1080 

caagaagcca aattgacaga ggagtgtgat cttctcattg agatcattca gcaaagacga 1140 

cagattattg gaaccaagat caaagaaggg aaggtgatga ggcttcgcaa actggctcag 12 00 

cagattgcaa actgcaaaca gtgcattgag cggtcagcat cactcatctc ccaagcggaa 1260 

cactctctga aggagaatga tcatgcgcgt ttcctacaga ctgctaagaa tatcaccgag 1320 

agagtctcca tggcaactgc atcctcccag gttctaattc otgaaatcaa cctcaatgac 1380 

acatttgaca cctttgcctt agatttttcc cgagagaaga aactgctaga atgtctggat 144 0 

taccttacag ctcccaaccc tcccacaatt agagaagagc tctgcacagc ttcatatgac 1500 

accatcactg tgcattggac ctccgatgat gagttcagcg tggtctccta cgagctccag i 56 0 

tacaecatat tcaccggaca agccaacgtc gttagtctgt gtaattcggc tgatagctgg 1620 

tlH?t a9 l a( : f caacatcaa 3cagaaccac tacacggtgc acggtctgca gagcggcacc 1680 

aagtacatct tcatggtcaa ggccatcaac caggcgggca gccgcagcag tgagcctggg 1740 

aagttgaaga caaacagcca accatttaaa ctggatccca aatctgctca tcgaaaactg 1800 

aaggtgtccc atgataactt gacagtagaa cgtgatgagt catcatccaa gaagagtcac i860 

acacctgaac gcttcaccag ccaggggagc tatggagtag ctggaaatgt gtttattgat 192 0 

agtggccggc attattggga agtggtcata agtggaagca catggtatgc cattggtctt 1980 

gcttacaaat cagccccgaa gcatgaatgg attgggaaga actctgcttc ctgggcgctc 2 040 

tgccgctgca acaataactg ggtggtgaga cacaatagca aggaaatccc cattgagcct 2100 

gccccccacc tccggcgcgt gggcatcctg ctggactatg ataacggctc tatcgccttt 2160 

tatgatgctt tgaactccat ccacctctac accttcgacg tcgcatttgc gcagcctgtt 2220 

tgccccacct tcaccgtgtg gaacaagtgt ctgacgatta tcactgggct ccctatccca 2280 

gaccatttgg actgcacaga gcagctgccg tgagcgtctg gccacatgga gctgctttct 2340 

ggggaacagt aaggttcagg ccactattta ggggactgag aaagcacagg cttcatgagt 2400 

gtaatgaaat ctcaccagaa gtgtcccgaa atcggctcag atagggctca aaacaagaga 2460 
ttcctctcct tttactgtgt cttgtattaa gtacgggctt taataatttc tttaattttt 



2520 



ttgtatttag aggaaaatct atagattatt tataagagaa acataatcag gattacaact 
tttaggaatt acttggtttt gcacattaag aggcccataa gtttatcagc tatttacaac 
cttcatttca tcacaatctg tgggcttaca aaaaaacaaa aacttttgta gttttgtatg 
ttactcatct tcttacctga tatcccatga tgatcccatg gtaggtcttc tcacctcgat 
aatacataac aggatgtgtt tgaacctagt aggggaggaa acaggctttc ttactctggt 
ttaatttgaa gtgttttaat tgtgatgtca aaaagttgta tcagatcaac taaaatggag 
aacaagacag agaatgaaaa gagttgattt tggacctcgg accttgccgt ggctaaatct 
ttaccttctc atagctgatg ggataatgtt ggaaagaaag gttgtgaatc ctttggccac 
attttgccct gcttctctca gggttaaggg ttctggaaga acattaagaa tgagatgcaa 
ttgaaaatag tcattttgaa tcctattgat tattcaaaaa ttcaggctga ttgtctttta 
tcagaggtag gattctgttt tatagtatag aatctacttt atcccttcct tttaatagtt 
cctttagacc tgtgaaattt cttcactaca tttaatagtt ctcctatttc ccgctccccc 
atatcaattt tccttttgtc tccggggctg agtaaataaa catgttctgt cacaaatagc 
aocaccactt tggattgatt ttgctctcca ggacatcagc acatggccct gatcagcact 
accacatcca aacataagtc actgaaaaac acttaatatt tatgagttgg taatgacaag 
ggacattgta taaagtacta tttgctagat tcatgcctca aaagttatta taaacagacc 
tttattaaac acatcttgaa agatgtagaa gtccctctat agtctagtat agtttacaat 
agagttgtaa gaccaaaaaa aaaaaaaaaa aaaaa 



2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3575 



Homo sapiens IL-1 receptor accessory protein mRNA, complete cds . 

/translations "MTLIiWCWSLYPYGILQSDASERCDDWGIjDTMRQIQVFEDEPARI 
KCPLFEHPLKFNYSTAHSAGLTLIWYWTRQDRDLEEPINPRLPENRISKEKDVLWPRPT 
LLNDTGNYTCMLRNTTYCS KVAFPLEWQKDS CFNS PMKL PVHKLY 1 E YG I OR I TC PNV 
DGYFPS S VKPTI TWYNGC YKIQNFNNVIPEGMNLS FL IAL I SNNGNYTCWTYPENGRT 

WTIDGKKPDDITIDVTINESISHSRTEDETRTQILSIKKVTSEDLKRSYVCHARSAKGE 
^^KAAKVKQKVPAPRYTX^IACGFGATVLLVVlLIVVYHVYWIiEMVLFYPJ^HFGTDETI 
I^GKEYDIYVSYARNAEEEEFVLLTLRGVLENEFGYKLCIFDRDSLPGGIVTDETLSFI 
QKSRRLLVVLSPNYVLQGTQALLELKAGLENMASRGNINVILVQYKAVKETKVKELKRA 
KTVLTVI KWKGEKS KYPQGRFWKQLQVAMP VKKS PRRS S SDEQGLS YS SLKNV " 

tctcaaagga tgacacttct gtggtgtgta gtgagtctct acttttatgg aatcctgcaa 60 

5X25? Ca9aaC9 f 9 cgatgactgg ggactagaca ccatgaggca aatccaagtg x " 

tttgaagatg agccagctcg catcaagtgc ccactctttg aacacttctt gaaattcaac 180 

tacagcacag cccattcagc tggccttact ctgatctggt attggactag gcaggaccgg 240 

gaccttgagg agccaattaa cttccgcctc cccgagaacc gcatLgtal ggaglaagl? 30S 

gtgctgtggt tccggcccac tctcctcaat gacactggca actatacctg catgttaagg l 60 

attgcagcaa agttgcattt cccttggaag ttgttcaaaa agacagctg 420 

ttcaattccc ccatgaaact cccagtgcat aaactgtata tagaatatgg cattcagagg 480 

atcacttgtc caaatgtaga tggatatttt ccttccagtg tcaaaccglc tatcac^S til 

tatatgggct gttataaaat acagaatttt aataatgtaa tacccgaagg tatgaactta loo 

agtttcctca ttgccttaat ttcaaataat ggaaattaca catgt|t4? tlStatcc? eel 

gaaaatggac gtacgtttca tctcaccagg actctgactg taaaggta^t aggctctcca ill 

aaaaatgcag tgccccctgt gatccattca cctaatgatc atgtggtcta tglgaaagaa 780 

a^SS*?? a9 f actca ^ tccctgtacg gtctatttta gttttctgat ggaltct^gc III 

aatgaggttt ggtggaccat tgatggaaaa aaacctgatg acatcactat tgatgtcacc 900 

attaacgaaa gtataagtca tagtagaaca gaagatgaaa caagaactca gattttgagc 960 

atcaagaaag ttacctctga ggatctcaag cgcagctatg tctgtcatgc tagaagtgcc 1020 

aaaggcgaag ttgccaaagc agccaaggtg aagcagaaag tgccagctcc aagatacaca 1080 

cttgtggttt tggagccaca gtcctgctag tggtgattct cattgttgtt 1140 

llr-ttltt actggctaga gatggtccta ttttaccggg ctcattttgg aacagatgaa 1200 

accattttag atggaaaaga gtatgatatt tatgtatcct atgcaaggaa tgcggaagaa 1260 

gaagaatttg tattactgac cctccgtgga gttttggaga a4aat??gg aLSaJctg llll 

aSaa-I acc * a 9 aca ? tctgcctggg ggaattgtca cagatgaglc tttgagftS llll 

attcagaaaa gcagacgcct cctggttgtt ctaagcccca actacgtgct ccagggaacc 14 40 

a a S^S CC ? 9a9 f Caa 9 ^ ct 99 ccta 3aaaatatgg cctctcglgg caacltcaac xsol 

t 9 * aca 9 tacaa agctgtgaag gaaacgaagg tgaaagagct gaagagggct 1560 

aagacggtgc tcacggtcat taaatggaaa ggggaaaaat ccaagtatcc acagg^clgg l 62 o 

ttctggaagc agctgcaggt ggccatgcca gtgaagaaaa gtcccaggcg gtctlicag! 1680 

gatgagcagg gcctctcgta ttcatctttg aaaaatgtat gaaaggaata atgaaLgga 1740 



Homo sapiens clone FIjB0708 mRNA sequence. 

ccaagaggtg ggaacaatct aaatgtccaa cagatgaatg aatttttaaa aagtggtata 
tatacataca ttgagatatt attcagcctt aaaaaagaag aaaaatcatg gccgggcgcg 
gtggctcacg cctgtaatcc cagcactttg ggaggccgag acgagcgaat cacgaggtca 
ggagatggag accatcctca ttaacatggt gaaactctgt ctctactaaa aatacaaaaa 
aattagccgg gtttagtggt gggcgcctgt agtcccagct actcaggagg ctgaggcagg 
agaatggcat gaacccggga ggcggagctt gcagtgagcc gagatcgcgc cactgcactc 
caqcctgggc gacagagcga gactccgtct gaaaaaaaaa aaaagggaaa aatcctgcca 
catgatatgg tatgggtcaa acttgaagac attaagctaa ataaaatgtc agtcacaaaa 
agacaaatat tatatgattc cactcacatg aagtatcaag taatcaaact cacagaaaaa 
qaaagtaaaa ttgtggttgc caatggttca gggtgaaaaa aagggagtta gtgtttaatg 
qataagaggt tcagtttcgc aagacaaaag atttctggat atttgttgca caacagtatg 
agtataatta atgctacaga actgttagaa aagagtctct ttcagattta gatactagaa 
aatgtatgag taaaatacga tgtctgaaat ttgctttcaa ataatctgaa ggctggttgg 
ggaagttgtt ggagtcatac atgaaataaa actggtatta gttgacaatc cttaaaactg 
agtgggttta ttataccatt ctctctctac ttttgtgtat gtttgaaatt ttccatcata 
aaggagtttt taaaaaccca acattatcaa aatgaaaaat aatcaataca agtgctggat 
aagaaagtca aggaaatatc acagaatgta taatttaaaa gatttgctga ggtgtgtgta 
tcacctgagc tcaggagttc gagactagcc tggccaaaat ggcataaccc catctctaca 
taaaatacaa aaatcagctg ggaacactgg tgcacacctg tagtctcagc tactcaggag 
gctgagacac gagaatcact tgaacccagg aggcagaggt tgcagtgagc tgagatcacg 
ccattgcact ccagcctggg tgacagagac agactctgct tggtcacttt tttggcggag 
gagaatgcag ttaaaaagga catgtccccg gggattcgac tacacttctc aaagtgtact 
gctggccctc tgtatccatg ggttctgcat ctgtagattc gatcaactca actcctggct 
caatactgat ggaagtaatc tgcttaacaa tctcagaagg actgtgcaag tcaatgagtc 
gcttgtgaat tctcatctgg aaacgatccc acgtcttaga accttcacca caaggagttt 
ttcttgtagt gattctcaaa gtcttggtag gcattcgaac tggtcctttc actttgagat 
tcttttcttt tgcgcctctt atcaagtcag cacacacctt ttccaaggat tttacgttgc 
ggcttgttag ggtgattcga attcggtgaa ttgccacctc cggctccacg ggtgtttttc 
cggtatcctt aaaagccatg gctgttgcgc gcgggcttcc tgaccgactt gttcctcggc 
gagagcgaac agcggtgagt caggagcagg agcgtgcgga ccaaaaatcc tcagccctta 
cgaccgcgtc ttcctcaaaa aaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1825 



TABLE 2 



202825_at 
205844_at 
204808_s 
205264_at 
202687_s 
208323_s" 
206239_s 
207655_s 
220041_at 
203178_at 
218747_s 
217933_s 
214373_at 
205552_s 
211172_x_ 
204228_a1 
203787_at 
204994_at 
203567_s 
215464_s 
218280_x" 
AFFX-HUi 
219211_at 
219691_at 
217761_at 
214022_s 
218017_s" 
214290_s" 
216565_x 
204739_at 
AFFX-HUP 
200790_at 
202446_s 
203903_s~ 
AFFX-HUF 
219366_at 
206332_s 
202269_x" 
201601_x" 
AFFX-HU? 
202430_s 
208268_at> 
202388_at 
204259_at 
220084_at 
200887_s 
218943_s" 
219209_at 
209969_s 
208965_s~ 
215252_at 
208966_x 
_203372_sl_ 



B+ 

Signal Det 
116.6 A 

188.6 P 
134.5 P 

151 M 
100.1 P 
2738.1 P 
585.3 P 
98.9 P 

162.8 P 
26.8 P 
18.1 A 
482 P 

110.7 A 

283.7 P 

53.7 A 

150.5 A 

43.5 P 
174.3 P 
1 06.2. A. 

92.4 M 
2755 P 

177.9 P 

64.8 A 
83.8 P 
479 P 

1101.3 P 
48 A 
547.9 P 

179.8 P 

45.3 A 
399.3 P 

481.6 P 
982;6 P 
182.8 P 

35.6 A 
1265 A 
tl8.9.P 

295 A 
664.8 P 
126*9 P 
281.8 P 

<f3;3A : 
474.9.P 

445.8 P 

56.4 P 

583.7 P 
675 A 

114.9 P 
95.1 P 
43:9 A 
415 A 

98 p 
:^13-A 



B+vs 
Det p-valtSLR 
0.129639 
0.001953 
0.018555 
0,056152 
0.000244 
0.000244 
0.000244 
0.018555 
0.030273 
0.030273 
0.303711 
0.001953 
0.111572 
0:000244 
0;067627 
0.095215 
0;010742 
0^00415 
0.129639 
0.656152 
0000732 
0.012547 
0.303711 
0.000244 
0-000732 
0.000244 
057417 
0.000244 
0.010742 
0.080566 
0.000225 
0.001953 
0.000244 
0:000732 
6.313723 
0:080566 
0i0Q0732 
0.171387 
0.000244 
0:000081 
0,000244 



G+ 



0.001953 
0.037598 
0.010742 
0.000244 
057417 
0:00293 
0-037598 
0;1 11572 
0:334473 
0i001221 
^):-129639- 



-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
-1 
^1,1 
-1.1 
-1.1 
-1.1 
-1.1 
-1.1 
-1.1 
*14 
-1.1 

-.15 
-1.2 
-15 
-1.2 D 
-15 D 
-15 D 
-15 D 
-15 D 
-15 D 
-15 D 
-15 D 
-%.2D 
-T5 D 
-1:3 D 
-i.3 D 
-1:3 
-1.3 
-1.3 
-1^ 
-1.4 D 
-1.4 D 
-1 .4 ;D 
-155 -D 
-1:5 D 



Change 
D 
D 



D 
Q 
P 
D 
D 
D 
D 
D 
D 
D 
D 
D 
D 
D 
D 
O 
D 
O. 
D 
D 
D 
D 
D 
D 
D 
D 
D 
D 
D 



■D 
D 
D 
D 



Change p- 
0.999853 
0.99998 
0.999226 
0.999308 
0:99998 
0.99998 
0.99998 
0.99997 
0.99998 
0.99987 
0.999611 
0.99998 
0.999693 
0.99998 
0.998923 
0.998923 
0.996301 
0.998664 
0.99981 1 
0:998923 
0.99998 
1 

0.999886 
0.99998 
0.99998 
0:99997 
0.99987 
0.99998 
0.999973 
0.999759 
1 

0.99998 
0.99998 
0.99998 
0.99985 

0.999135 
0:99996 
0.99775 
0.99998 
. .1' 
0.99994 

0.996959 
0.99998 

0.999833 
0:99996 
0.99998 

0.999899 

0.999954 
0:99998 

G.999654 
0:9^751 
0:99996 
-0-.995927 



AFFX-HU* 
210738_s. 
210163_at 
215447_at 
219352_at 
203908_at 
205345_at 
AFFX-r2-h 
203153_at 
213797_at 
206664_at 
202086_at 
216200_at 
214059_at 
205771_s 
204972_at 
218986_s_ 
207057_at 
214453_s 
215729_s 
211520_s. 
213293_s 
204439_at 
202664_at 
215241_at 
20461 5_x 
205128_x_ 
221760_at 
204044_at 
205939_at 
201749_at 
201626_at 
31637_s_J 
201627_s : 
213348_at 
213154_s. 
45714_at 
200599_s 
203252_at 
214581_x 
203207_s 
217168_s* 
218627_at 
218145_at 
200598_s 
220892_s 
212274_at 
213448_at 
40093_at 
212272_at 
205830_at 
212218_s! 
201005_at 
214152_at 
204058_at 
201790 s. 



61.3 P 

13.8 A 
9.2 M 

iXA A 

93.9 P 
71.5 P 

23.8 A 

74.9 A 
341.9 P 

69.3 A 

54.5 P 
167.8 P 
4 A 

78.5 P 
225;8 P 

190.4 P 
75.5 P 

9.2 A 

155.0 P 
212 P 

4.2 A 
. 83.8 P 

110.5 P 
1.8 A: 

13:6 A 
1342.2 P 
548.8 P 

598.4 P 

250.1 P 
125P 

265.7 P 
1377.6 IP 
882 P 
2066;8 P 

207.5 P 

153.2 P 

259.6 P 
2866.6 P 

323.6 P 

415.5 P 
. 296.6 P 
2143 J .P 

204.5 P 
2828.4 P 
1610.6 P 
1246.1 P 

2272 P 

125.3 P 
-378.7 P 

"95;6B 
:- : *7£'pV 

#7632#- 
24© P 
. 023Q P 
.. 837.4 P 



0.004998 
0.129639 
0.056152 
0:432373< 
0.046.143 
0:001953 
0.366211 
0.129639 
0:008057 
0219482 
0.001953 
0.010742 
0.533936 
0.000244 
0.00415. 
0:00293 
0:01416 
0.432373 
0.001221 
0;0141j6 
0.72583 
0:023926 
0.018555 
0.432373 
0.432373 
0.OO1953 
0:001221 
0.000244 
0.01416 
0.000244 
0:000732 
0:000244 
0:0Q7543- 
0^00244- ; 

0:001953 

0.005859 
0:007543 
0:000244 
.0000732 
0000244 
•.6:000244; 
. '.QK300244 

O:000244 
0O0O244 
0.000244 
0.037598 
0023926 
0i00l354 
;• bife759S 
. < v© T i60293 

b:ob585j9:. 

O..O002# 
0000244 



-1.5 D 
-1.6 D 
-1.6 D 
-.1.6 O- 
-1.7 D 
-1 J D 
-1.7 D 
-1J7 D 
-1.8 D 
-1.8 D 
-1.8 D 
-1.8 D 
-1.8 D 
-1.9 D 
-1.9 D 
-1.9 D 
-1.9 P 
-1.9 D 
-2.2 D 
-2.4D 
-2.4.D 
t2.5D 
-2.8 D 
-2;8 D 
-3D 

1 

1 

1 

1 

1 

I! 

1 

.•--•1':; 

-II 
1 l 
1,1 
1 1 
1.1 
11 

i i . 
ii, 
1 1 
1 1 
1 1 
1 1 
1 1 
i\, 
1 1 

ti- 
ll 
II 

1.1 



0.997968 
0.999973 
0096645 



0.999932 
0.999954 
0.999973 
0.99998 
0.999727 
0.99998 
0.99998 
0.999654 
0:99998 
0:99998 
0.99998 
0.999922 
0.995027 
0.99998 
0.999973 
0.997247 
0.999693 
0.99998; 
0.999611 
0.999382 
0.000027. 
0 000027 
0.00002 
0.000273 
0.000023 
0.000101 
000002 
0i6o027* 
r 0000068 
0O0O167; 

0.00002 
0.0000.08 
0.00002 
0.000027 
0:000046 
... 0100002! 
: 6:C 
0O0304-1 

oibbbbs 

0.00002 
0000023 
0:000046 
. 0.00.1486 
0:0000^12 
.0201 

0100002: 

oibfo 

. O.OO0I 
O:OOO0'68 

0:00002 



20921 8_at 
218963_s 
217790_s 
204059_s 
220451_s 
217025_s 
204205_at 
21O069_al 
2081 16_s 
221577_x" 
210202_s" 
212119_at 
203875_at 
214315_x 
213802_at 
213424_at 
203675_at 
202275_at 
206683_at 
221750_ai 
205127 al 
208291~s 
221485_at 
208763_s 
208937_s" 
221511_x~ 
214151_s 
209850 s" 
202842V 
201012_at 
218025 s 
206125~s" 
20421 7_s" 
212276_at 
205822_s 
218677_at 
209146_at 
202557_a1 
202806 at 
206574~s 
221156_x~ 
209047_at 
221701_s 
204588_s 
212120_at 
202409_at 
213716_s 
218358_at 
211031_s 
47560_at 
222238_s 
213577_at 
211071 s 
208608jb~ 
2031 65_s" 
218681_s" 



3285 P 
116 P 
66/7 P 

376.1 P 
123.5 P 

116.4 P 
73.9 P 
71.8 P 

296.2 P 
780 P 

141.7 P 
419.1 P 
76.3 P 

860.5 P 

50.6 P 
36.8 P 

165.5 P 

506.7 P 

116.8 P 

428.6 P 

77.7 P 

250.4 P 

888.9 P 

717.7 P 
1120.4 P 

569:9 P 
245.6 P 
342.9 P 
1130.8 P 

965.6 P 

85.8 P 

270.5 P 
217.4 P 

299 P 

405.7 P 
815:4 P 

1219.4 P 
157.3 P 
148.1 P 

131.3 P 

241 .4 P 
237.1 P 
403.1 P 
468.7 P 

565.7 P 
506i9P 

438.8 P 

810.5 P 
252.7P 

343 P 
103.3 P 

802.1 R 
237.8 P 

864.2 P 
142.1 P 
364J-P 



0.000244 


1 I 


0.046143 


1.1 I 


0:023926 


1.1 I 


0.000732 


1:11 


0:008057 


1.1 1 


0.005859 


1.1 1 


0.00293 


1.1 I 


0.008057 


1.1 I 


0.000244 


1.1 I 


0.000244 


1.1 1. 


0.004.15 


1:1 1 


0.008057 


1.1 1 


0.00293 


1.1 1 


6.000244 


1.1 1 


0.01416 


1.1 1 


0.001221 


1.1 1 


0.000244 


1.1 1 


0.001221 


1.1 1 


0.001953 


1.1 1 


0.000244 


1.2 1 


0.037598 


1.2 1 


0:001953 


1.2 1 


0.000244 


1.2 1 


0.00415 


1.2 I 


0.000244 


12. I 


0.001953 


1:2 I 


0:018555 


1.2 1 


0:046143. 


1.2 1 


0.000244 


1.2 I 


O.O0Q244 


1.2 1 


0.00293 


12. 1 


0.030273 


1.2 I 


0.010742 


1,2 1 


0.000244 


12\ 


0.000244 


1:2 1 


0.000244 


1.2 I 


0.000244 


12 1 


0.000244 


12 1 


0.008057 


1.2 I 


iQ;00415 


1:2 1 


0X)0Q732 


1.3 1 


0:001953 


1 .3 I. 


0.010742 


1,3=1 


0:00415 


1.3 -J 


. 0.000244 


1.3 1 


0.000244 


1:3 I. 


10:008057 


1.3 I 


000293 


.1,3 1 


a:004t5 


1.3 I 


0.003067 


1.3 1 


0.037598 


1.3 1. 


0.000244 


1.3 I 


0.00293 


1.3.1 


0.000244. 


1:4 1 


0.QQ9244 


1:4 1 


-0.804224 


—1^1— 



0.00002 
0.00225 
0.00225 
0:000023 
0.001486 
0.000027 
0.000189 
0.001651 
0.00002 
0,00002 
0.003699 
0.000035 
0.001336 
0.00002 
0.000774 
0.000865 
0.00006 
0.00003 
0.00002 
0:00002 
0.000023 
0.000438 
0.00002 
0.00002 
0.00002 
0.00002 
0.00002 
0,00003 
0.00002 
0.00002 
0.000438 
0.000438 
0:000035 
0;0b0167 
0.00002 
0:00002 
0:00002 
0.00003. 
0.000189 
0:000241 
6:000023 
0.00003 
0:000046 
0:00002 
0.00002 
0;OOOQ2 
0.000114 
0.000023 

o.obboi 

0:000191 

O.O00389 
0.000023 
0.00004 
0:00002 
/O:GOO02 
-W)00b23 



34408_at 

216449 x 

205042_at 

208121 s 

2061 99jat 

217594_at 

201631_s 

212345_s. 

202539_S 

213562_s. 

21991 1_s 

212944_at 

217678_at 

209504 s 

208146% 

206286_s 

221679_s. 

209189_at 

211936_at 

204268_at 

213164_at 

200825_s_ 

210181_S 

222156_x_ 

212122_at 

21 9091 _s. 

201841_s 

2061 98_s. 

211848_s 

20901 6_s_ 

209921_at 

204540_at 

215058_at 

202843_at 

20531 9_at 

202655_at 

204773_at 

202887_s 

204724_s_ 

201246_s 

208868_s" 

209443_at 

207761_s 

208321_s" 

213201_s" 

207574_s 

212702_s" 

205691_at 

201105_at 

210807_s 

216341_s~ 

203637__s" 

205227_at 

216247 at 



506.7 P 
733.5 P 
589.2 P 
282.2 P 

2294.8 P 
24.6 P 
1124.5 P 

306.8 P 
704.5 
892.1 
925.8 
767.8 
334.6 
430.9 
175.9 

266 
54 
256.1 
3417.5 
483.6 
945 
18242 
58;4 
153.9 
67.4 
379 
1572.5 
876.6 
974.2 
56.2 
458.5 
1656.5 P 
'68 ;5 P 
1BJ5.7 
201.2 
700.7 
72.3 
30082 
292.3 
1065 
. 60.7 
310:9 
■81.7 
86:8 
575.3 
81.7 
97 .4 M 
114 M 
9161 P 
.^34:8 A 
"26 .'A 
2%» A . 
..;34^;.A\' 
79:4 P 



P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P 

P. 

P 



P 
P 
P 
P 
P 
P 
P 
P 
P 
P 
P 
P 
P 



0.000219 

0.000732 

0:000244 

0.000244 ■ 

0.000244 

0.046143 

0.000244 

0.000244 

0.000732 

0.000244 

0.000244 

0.000244 

0.000244 

0.00293 
0.005859 
0.001953 
0.030273 
0.008057 
0.000244 
0.000732 
0:000244 
0.000244 
0030273 
0,000244 

0:00415 
0.000244 
0.000244 
0.000244 
0.000244 
0.030273 
0.000732 
0.000244 
0:018555 
0:001221 
0*001953 
0,001953 

0:01416 
0.000244 
0*0;10742 
0.000244 
0,010742 

0.00293 
0.018555 
0.018555 
0.000244 

O.t)0293 
0.056152 
0.056152 
O.000244 
0.1,7^38;7' 

mm&2 
:pitii572i 

0329639 
>6j0b293 



1:4 I 
1.4 I 
1.4 I 
1.4 I 
1.4 I 

1.4 I 

1.5 I 
1.5 I 
1.5 I 
1.5 I 
1.5 I 
1.5 I 
1.5 I 

1.5 I 
1*6 I 
1*6 I 

1.6 I 
1.6 I 
1.6 I 
1.6 I 
1*6 I 
1.6 I 

1.6 I 

1.7 I 
1.7 I 
1.7 I 
1.7 I 
1.7 I 

1.7 I 

1.8 I 
1.8 I 

1.8 1 
118 I 

1.9 I 
1.9 I 

2 I 
2.2. 1 
22 I 

2.2 I 

2.3 I 
2*4 I 
2.5 I 
2.5 I 
2:5 I 
3.1 I 
3.1 I 
3.3 I 
3.8 J 
4.8 I 

-1.1 ft/ID 
-12 MD 
-1,6 Mb 
-1.8 MD 
1 Ml 



0 

0.000035 
0.000046 
0.00002 
0.00002 
0.004481 
0:00002 
0.00002 
0.000052 
0.00002 
0.00002 
0.00002 
0.000023 
0.000027 
0:000027 
0.000068 
0.002753 
0.00002 
0.00002 
0.00002 
0:00002 
0.00002 
0.000046 
0.00003 
0.0Q0241 
0.000027 
0.00002 
0.00002 
O.O0002 
0*000167 
0.00002 
0.00003 
0.00249 
0.00002 
0.000088 
0*000027 
0.004073; 
0.00002: 
0.00002 
0.002032 
0.000618 
0.000241 
0.000035 
0.003355 
0.00002. 
0:000023 
0.000088 
0.000273 
0.00002 
0:994067 
0.994067 
0.994591* 
0:995075j 
:0:004925 
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